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Series Editor’s Preface 


‘Approach your problems from the 
right end and begin with the an- 
swers. Then one day, perhaps you 
will find the final question. 


‘The Hermit Clad in Crane Fea- 
thers’ in R. Van Gultk’s The Chinese 


It isn’t that they can’t see the so- 
hition. 

It is that they can’t see the prob- 
lem. 


G. K. Chesterton, The Scandal of 
Father Brown‘Vhe Point of a Pin’. 


Maze Murders. 


Growing specialization and diversification have brought a host of monographs 
and textbooks on increasingly specialized topics. However, the ‘‘tree”’ of know- 
ledge of mathematics and related fields does not grow only by putting forth 
new branches. It also happens, quite often in fact, that branches which were 
thought to be completely disparate are suddenly seen to be related. 

Further, the kind and level of sophistication of mathematics applied in 
various sciences has changed drastically in recent years: measure theory is 
used (non-trivially) in regional and theoretical economics; algebraic geometry 
interacts with physics; the Minkowski lemma, coding theory and the struc- 
ture of water meet one another in packing and covering theory; quantum fields, 
crystal defects and mathematical programming profit from homotopy theory; 
Lie algebras are relevant to filtering; and prediction and electrical engineering 
can use Stein spaces. And in addition to this there are such new emerging 
subdisciplines as “experimental mathematics”, “CFD’’, “completely integrable 
systems’’, “‘chaos, synergetics and large-scale order’, which are almost impossible 
to fit into the existing classification schemes. They draw upon widely differ- 
ent sections of mathematics. This programme, Mathematics and its Applications, 
is devoted to new emerging (sub)disciplines and to such (new) interrelations 
as exempla gratia: 

—a central concept which plays an important role in several different mathe- 
matical and/or scientific specialized areas; 

— new applications of the results and ideas from one area of scientific endeavor 
into another; 

— influences which the results, problems and concepts of one field of enquiry have 
and have had on the development of another. 

The Mathematics and Its Applications programme tries to make available a 
careful selection of books which fit the philosophy outlined above. With such 
books, which are stimulating rather than definitive, intriguing rather than 
encyclopaedic, we hope to contribute something towards better communication 
among the practitioners in diversified fields. 

Because of the wealth of scholarly research being undertaken in the Soviet 
Union, Eastern Europe, and Japan, it was decided to devote special attention 
to the work emanating from these particular regions. 

Thus, it was decided to start three regional serics under the umbrella of the 
main MJA programme. 

The present volume in the Eastern Europe series can perhaps be best described 
as a supereconomical complete course in those parts of mathematics which 
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is required for understanding classical and modern mathematical physics and 
which is needed in order to work in these fields. Thus starting with real num- 
bers the author takes about 100 pages to deal with the calculus of variations 
and the principles of classical mechanics. Some 150 pages later, after dealing 
with function theory and other prerequisites, we are equipped to deal with 
hydrodynamics; then follow geometry, PDEs, operators and distributions, and 
on that basis classical field theory can be treated. Special relativity and electro- 
dynamics follow and are interlaced with other bits of necessary mathema- 
tics, such as tensors, differential forms and distributions on manifolds. The final 
250 pages deal with quantum mechanics, general relativity, including black ho- 
les and cosmology, wave equations in a curved space-time, singularities and 
catastrophes, and their physical applications. 


In a period wherein the ages-old “marriage” physics-mathematies has taken 
on new life, this seems a singularly useful volume. Mathematically trained 
professionals and students of mathematics will find here an almost royal road 
to physics and for physicists the book has been so written that it can be used as 
a reference volume for the mathematics needed. 


The unreasonable effectiveness of As long as algebra and geometry 
mathematics in science... proceeded along separate paths, their 
Eugene Wigner advance was slow and their appli- 

cations limited. 

Well, if you knows of a better ‘ole, But when these sciences joined 

go to it. company, they drew from each other 

Bruce Bairnsfather fresh vitality and thenceforward 

What is now proved was once only matched Ou Sessa suze exotds 

perfection. 


imagined, 


Maula Blas Joseph Louis Lagrange 


Bussum, April 1986 Michiel Hazewinkel 


Preface to the German Edition 


From 1974 to 1979 I had an opportunity, certainly one out of the ordinary, 
to read a conrse of lectures extending continuously over ten terms for students 
of mathematics at the Friedrich Schiller University, Jena. According to the 
curriculnm these lectures had different names (differential and integral calculus, 
ordinary differential equations, ete.), but certainly the content as well as the 
objects are best expressed by “Analysis and Mathematical Physics”. This book 
presents the extended skeleton of this course; a skeleton insofar as proofs have 
largely been omitted (in contrast to major parts of the lectures). On the other 
hand, Chaps. 27, 32, and 33 have been added subsequently. 

The object of the course becomes clear from having a look at the Table of 
Contents of this book: on the one hand mathematics has developed grandiose, 
elegant, consistent theories, which need no further justification. On the other 
hand, frequently it is just the most beautiful of these theories which also form 
the foundation on which classical as well as modern theoretical physics are 
based. It was the object not only to describe these foundations, but also to give 
an impression of the frames that can be erected on them. True to Hilbert’s 
ideal, mathematical theories are carefully separated here from their physical 
interpretations. 

This book is addressed to mathematicians, physicists, and students of mathe- 
matics and physics. Especially the mathematical chapters have been so written 
that they can also serve as a book of reference. Mathematicians will find the 
principles of classical and modern theoretical physics presented in a language 
familiar to them. By the concise description of some mathematical foundations 
of classical and modern theoretical physics, the book is hoped to be also of use 
for physicists. 

Most recently there has been a reapproach between theoretical physics and 
mathematics. This book is intended to promote this trend: among mathemati- 
cians and physicists, but especially among students of both disciplines. 

Having told what this book may possibly accomplish, let us refer to what 
it is not capable of doing. It is neither a textbook nor a collection of concise 
monographs. A student of mathematics will not be spared cutting his way, line 
by line, in suitable textbooks through the proofs of those theorems which are 
only formulated and commented here. Some passages in this book will serve 
their purpose if they give an appetite for further study. For further details and 
proofs the reader is referred to the quoted references. 

Let us also point out another peculiarity that is connected with the fact 
that methodical aspects have to be considered in a 10-terms course. (The lectures 
were compulsory up to the 5th term, which corresponds to Chap. 19, for all 
students of the class concerned, while the advanced course was compulsory 
only for the specialists in analysis. The final part of the course was facultative.) 
Some of the subjects appear repeatedly on ascending levels of abstraction. By 
way of example, Chap. 19 presents the classical foundations of the theory of 
partial differential equations, without any embroidery or modern accessories. 
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Who wants to know more must embark on a deeper study. After the theory of 
distributions has been presented, partial differential equations are dealt with 
again on this basis in Chap. 23. Finally, the wave equation in curved space-times 
is investigated in Chap. 33. This is based on the preceding Chaps. 29 and 32 on 
geometry. A look at the Table of Contents shows that there are other chains of 
this kind, such as the stepwise development of the integral concept, or the 
chapters on tensors, forms, and differential geometry. It has been a principle of 
this course to develop, on the one hand, the fields of mathematics presented in an 
exact and (as far as possible) gapless way, but, on the other hand, to aim just 
at that degree of generality which is absolutely necessary for the understanding 
of the subsequent physical chapters. Sometimes intuitively geometric arguments 
are given preference over formally logical conclusions. This is not only a question 
of time and space but also a matter of personal taste. 

The course covers a relatively wide field, and it is almost a matter of course 
that at many points the author could not give presentations of his own that 
might make pretence to originality. Many chapters are an adapted interpreta- 
tion of corresponding monographs. This is the case already with Chaps. 13 and 
14, which were read and written following closely the lines of P. R. Halmos [23]. 
Other examples are Chaps. 32/33 (F.G. Friedlander, [17]), Chap. 34 (Y.-C. Lu, 
{40]), and Chap. 28 ([66]). 

At the end of the book the reader will find a bibliography and references 
ordered by chapters. They are intended as acknowledgements, but also as 
recommendations for deeper studies. 

Finally, I wish to express my gratitude to the Teubner Verlagsgesellschaft, 
and particularly to Mrs. Ziegler, for their harmonious co-operation. 


Jena, in the fall of 1979 Hans Triebel 


Preface to the English Edition 


In comparison with the German editions we added few new references. 
Furthermore we included more quotations of eminent mathematicians and 
scientists, mostly about physics, either as epigraphs or as remarks. 


Jena, Spring 1986 
Hans Triebel 


Table of Contents 


Series Editor’s Preface 
Preface to the German Edition 


Preface to the English Edition 


1. Numbers and Spaces 1 

1.1. Real Numbers 1 

Hotta Number Systems 1 

iL tleee Distance and Axiom of Completeness 2 
ee Complex Numbers 3 

122.1 Definition 3 

Bae Properties 3 

lta Conjugate Elements, Subtraction and Division 3 
1.2.4. Normal Representation 4 

1.3. RR, Cy, and Metric Spaces 5 

Hasisil The x-Dimensional Real Space R,, 5 
lB The n- Dimensional Complex Space C,, 6 


1.3.3. The Metric Space 6 


2. Convergence and Continuity 7 
2plle Sequences 7 

ie Iles Infimum, Supremum, and Limit 7 
alee Properties of Convergent Sequences 8 
Pollesia Examples 9 

Zaoe Series 9 

Zadok Convergence and Divergence 9 

Met Examples 10 

Bee Criteria of Convergence 10 


2.2.4, Rearrangement, Multiplication, and Addition 11 


2.3. Real Functions in R, 12 
DBolle Definitions 12 


2.3.2. Properties of Continuous Functions 14 


Contents AL 


2.4. Continuous Mappings in Metric Spaees 15 
2.4.1. Definition 15 

242, Examples 16 

2.4.3. Real- Valued Continuous Functions in R, 17 
2.5. Complete Metrie Spaces 18 


eae Definitions 18 
DY, The Space C[a, 6] 18 
2 Banach’s Contraction Mapping Theorem 19 


or 
sigh 
. 


a 
te 


eo 


Differential and Integral Calculus in R, (Basie Concepts) 20 
3.1. Differentiation 20 


Rolled. Definitions 20 
Poll BX Rules 21 
es 


3. Examples (Rational Functions) 21 

3.1.4. Inverse Functions 22 

Balla. Mean Value Theorems 22 

Boll fh, Higher-Order Derivatives; Derivatives of Complex-Valued Functions 23 
Bea Integration of Real-Valued Funetions 24 

Bowell e Definition of Riemann’s Integral 24 

Beer Properties 25 

Beoae Commutation of Passage to the Limit and Integration 25 

3.2.4. Examples of Integrable Functions, and Counter-Examples 26 

Bee: Primitives 26 


Bvalb, Integral Operators 27 


4. Ordinary Differential Equations (Existence and Uniqueness 
Theorems) 29 


Holle Initial-Value Problems 29 

aol. The Differential Equation /(x)=0 29 

4.1.2. Formulation of the Problems 30 

4.2. Existence and Uniqueness Theorems 31 

a2 lle Systems of First Order 31 

AL Differential Equations of Order n 3 

4.2.3. Local Existence and Uniqueness Theorems 31 

5. Elementary Functions and Power Series 32 
5.1. Exponential Functions and Power Functions (Real-Valued) 32 
Hilal The Function c* 32 

Holes The Function log x 33 


Contents 


The Number e 33 
The Functions a* and log, x 33 
The Function x#* 34 


Trigonometric Functions 35 


The Functions sin x and cos x 35 

The Functions tan x and cot 2 36 

The Functions arecsin x and arctan x 36 
The Function ei? 37 


Exponential Functions and Power Functions (Complex-Vatued) 38 
The Functions e* and In z 38 

The Function 2”, Riemann Surfaces 39 

Roots of Unity, Fundamental Theorem of Algebra 40 

Power Series 41 

Radius of Convergence 41 

Addition and Multiplication of Power Series 42 

Differentiation of Function Sequences and Power Series 42 

Taylor Series 43 

Examples of, and Counter-Examples to Taylor Series 44 


Power Series for e?, Analytic Functions 44 
Irrationality of e 45 


Banach Spaces 45 
Definitions and Examples 45 


Definitions 45 
Examples 46 


Spaces of Type L, 46 


Inequalities 46 
The Spaces 15, n, Up,0; Ip,n, and lp,¢ 47 


Integral Calculus in R, (continued) 48 
Classes of Integrable Functions 48 
General Rules (Integration by Parts, Change of the Variable) 48 


Integration of Rational Functions, Partial Fraction Decomposition 49 
Integration of R(cos x, sin x) 50 


Integration of R(e*), LG, Vx2— 1), and F(a, Vx2+1) 51 
Integration of R(x, V1—a2) 52 
i 


I ti {Rk i) 52 
nte ration o 
g x, d 


a] 
° 
i) 
° 


Contents 
Improper Integrals 52 


Types of Improper Integrals, Examples 52 
Integral Convergenee Test for Series, Euler-Maseheroni Constant 54 
The 1’-Function 54 


Differential Caleulus in R,, 55 
Partial Derivatives 55 


Definition 55 

Commutativity of Partial Derivatives 56 

Taylor Polynomials 57 

n-Dimensional Power Series 57 

Curves and Surfaces in R,, Chain Rule 58 

Geometric Interpretation of the Taylor Polynomial 59 
Directional Derivative 60 


Implieit Functions and Theorems of Implicit Functions 61 


Formulation of the Problem 61 

Theorem of Implicit Functions, Curvilinear Co-Ordinates 62 
Theorem of Parameter-Depending Implicit Funetions 63 
Implicit Funetions 63 


Extreme Values of Functions 64 


The One-Dimensional! Case 64 
The n-Dimensional Case 65 


Integral Caleulus in R,, 65 


Definitions and Properties 65 


Q-Domains and J-Domains 65 

Integrals over G-Domains 66 

Properties 67 

Integrable Functions 67 

Integrals over J-Domains 68 

Iteration Theorem for n-Dimensional Integrals 68 


Transformation Formulas, Volume and Area Measurement 69 


Volume Measurement 69 
Transformation Formulas 70 
Are Length of Curves 70 
Area Measurement 71 
Surface Integrals 72 

The Unit Ball, P (5) 73 


Improper Integrals 74 


XII 


ATV Contents 


9.3. Integral Theorems 74 

9.3.1. Gauss’ Theorem 74 

O34 Green’s Formulas 75 

10. Ordinary Differential Equations (Methods of Solution) 76 
10.1. Separable, Homogeneous, aud Exact Differential Equations 76 


oY 


10.1.1. Formulation of the Problem 76 

10.1.2. Separable Differential Equations 76 
10.1.3. Homogeneous Differential Equations 78 
10.1.4. Exact Differential Equations 78 

10.1.5, Integrating Factor 79 


10.2. Linear Differential Equations of First Order 80 


10.2.1. The Equation y=f(x)y 80 
10.2.2. The Inhomogeneous Linear Differential Equation 80 


10.3. Linear Systems of Differential Equations of First Order 81 


10.3.1. | Fundamental Systems and Wronski’s Determinant 81 

10.3.2. Inhomogeneous Systems of Differential Equations 82 

10.3.3. Special Systems of Differential Equations 83 

10.3.4. Systems of Differential Equations with Constant Coefficients 83 


10.4. Linear Differential Equations of Higher Order 84 


10.4.1. Formulation of the Problem 84 
10.4.2. Fundamental Systems and Wronski’s Determinant 85 
10.4.3. Differential Equations with Constant Coefficients 85 


10.5. Continuous Dependence on Initial Values 86 
10.5.1. Systems of Differential Equations of First Order 86 


© 10.5.2. Differential Equations of Order n 86 
10.5.3. Continuous Dependence on the Right-Hand-Side 87 


it, Calculus of Variations 88 
11.1. Fundamental Equations of the Calculus of Variations 88 


11.1.t. Formulation of the Problems 88 
11.1.2. Preparatory Considerations 89 
With! The Euler-Lagrange Equations 89 


eS. Exampies 90 


11.2.1. A Preliminary Physical Remark 90) 
11.2.2. The Brachistochrone 91 


11.2.3. The Problem of the Straight Line as the Shortest Line Connecting Two Points 92 
11.2.4. Rotation-Symmetric Minimal Surfaces 93 


— ee 


Contents 
Principles of Classical Mechanies 94 
Modelling in Physies 94 


On the Relationship between Mathematics and Physics 94 
Mathematical Models 96 

Criteria for Models 96 

An Example 97 


The Model for Point Mechanies 98 


Hamilton's Principle 98 
An Example (Free Fall) 99 
The First Integral of Motion 99 


Systems of 2 Mass Points 99 


The Basic Model 99 

Force-Free Systems 100 

Conservative Systems 101 

Particle in a Potential Well, Harmonic Oscillator 101 


Planetary Motion 103 


Formulation of the Problem and Basic Model 103 
Plane Orbits, Kepler’s Second Law 105 

Kepler’s First Law 105 

Kepler’s Third Law 106 


Measure Theory 106 


Classes of Sets 106 


Algebras and o-Algebras 106 
Extension Theorems 107 
Borel] Sets in FR, 108 


Elementary Measures and Measures 108 


Definitions 108 

Propertics 109 

Heine-Borel’s Theorem 110 

Elementary Borel Measures in ft, 110 
Elementary Lebesgue Measure in &, 111 


The Onter Measure, Extension of Elementary Measures 112 


The Outer Measure 112 

The Induced Measure 113 

The Extension Theorem 113 

Borel, Lebesgue and Dirac Measures 114 
Uniqueness Theorems 115 


XV 


XVI Contents 
13.4. Measurable Functions 115 


13.4.1. Definition 115 
13.4.2. Properties of Measurable Functions 116 
13.4.3. Sequences of Measurable Functions 116 


13.4.4. Convergence Almost Everywhere, Convergence in Measure 118 
14. Integration Theory 120 
14.1. integrable Functions, Properties of Integrals 120 


14.1.1. Integrable Step Functions 120 

14.1.2. Integrable Functions 121 

14.1.3. Properties of Integrable Functions 121 
14.1.4. Properties of Integrals 122 


14.2. Fundamental Theorems of Integration Theory 122 


14.2.1. Convergence in L, 122 

14.2.2. Lebesgue’s Bounded Convergence Theorem 123 
14.2.3. Other Properties of Integrable Functions 123 
14.2.4. The Banach Space £,(X, %, gs) 124 

14.2.5. The Theorems of B. Levi and Fatou 125 


14.3. Transformation Formulas 125 


14.3.1. | Measurable Mappings and Image Measures 125 

14.3.2. A Special Transformation Formula 126 

14.3.3. Absolutely Continuous Measures, Theorem of Radon-Nikodym 126 
14.3.4. The General Transformation Formula 127 


14.4. Product Measures, Fubini’s Theorem 127 


14.4.1. © The o-Algebra in a Product Space, Measurable Intersections 127 
14.4.2. The Product Measure 128 

14.4.3. | Fubini’s Theorem for non-negative Functions 128 

14.4.4. Fubini’s Theorem for Arbitrary Functions 128 


14.5. Comparison between Riemann’s and Lebesgue’s Integrals 129 


5.1. Integrable Functions 129 
2 Lebesgue’s and Fubini’s Theorems 130 
5.3. ‘Transformation Formulas 130 


14.6. L, Spaces 13 


14.6.1. Definition 131 

14.6.2. | Hélder’s and Minkowski’s Inequalities 132 
14.6.3. The Spaces L,(X, B, 4) 132 

14.6.4. The Spaces L,(R,) and L,(Q) 133 


Contents XVII 


1d. Complex Function Theory 134 

165.1. Holomorphic Funetions 134 

15.1.1. The Complex Number Plane C 134 

15.1.2. Holomorphic Functions 135 

15.1.3. | Examples of Holomorphic Functions 136 
15.1.4. The Cauchy-Riemann Differential Equations, Harmonic Functions 137 
16.2. Integral Theorems 138 

15.2.1. Complex Curvilinear Integrals 138 

11) Cauchy's Integral Theorem 140 

15.2.3. | Cauchy’s Integral Formula 142 

16.3. Properties of Holomorphic Functions 143 
ADpcvol! Differentiability and Derivative Formulas 143 
WG) S52 Taylor Series 144 

15.3.3 The Identity Theorem 145 

15.3.4 The Maximum Principle 145 

15.3.5 Liouville’s Theorem 145 

15.3.6 The Fundamental Theorem of Algebra 146 


15.4. Theory of Singularities 146 
15.4.1. Laurent’s Series 146 
15.4.2. Singularities 146 


15.4.3. Systematic Complex Function Theory, Rational Functions 147 


15.5. Theory of Residues 148 


15.5.1 Cauchy’s Residue Theorem 148 

Is) The Logarithmic Residue 149 

15.5.3. | Mapping Properties of Holomorphic Functions 150 
15.5.4 Inverse Functions 151 


15.6. Holomorphic Continuation 151 


1S. The Disk-Chain Method 151 
6 


15.6.2. The Monodromy Theorem 152 
15.6.3 Riemann Surfaces 153 

15.7. Conformal Mappings 154 

15.7.1 Fundamental Properties 154 
15.7.2. Riemann’s Mapping Theorem 155 


15.8. Linear Transformations 156 


15.8.1. Conformal Mappings of C and C, 156 
15.8.2. The Group of Linear Transformations 156 
15.8.3. Invariance of Circles 156 


2 Triebel, Math. Physics 


XVIII Contents 


16.2.1. 
16.2.2. 
16.2.3. 
16.2.4. 


“Mapping Properties and Cross Ratios 157 


Fixed Points and Types of Mappings 157 
Schwarz’s Reflection Principle 159 
Conformal Mappings of the Unit Disk 159 


Special Functions 159 


The Functions e* and Inz 159 
The Functions sin z, cos z, tan 2, and cot z 160 
Partial Fraction Decomposition for cot z 161 


Principles of Hydrodyuamies of Plane Flows 162 


The Fundamental Equations of Hydrodynamics 162 


Preliminary Remarks on Modeling 162 
Source-Free and Circulation-Free Flows 162 
Real and Complex Fundamental Equations 163 
The Mathematical Model 165 


Flow Regimes 166 


Stagnation Point Flow and Multipole Current Flow 166 

Swirl Flow Regimes 167 

Profile Flow Regimes 167 

Conformal Mappings in Hydrodynamics, Zukovskij Profiles 169 


Elements of Geometry 171 


Geometry of Space Curves in Ry; 171 


The Moving Trihedral 171 

Frenet’s Formulae 172 

Plane Curves 173 

Existence and Uniqueness Theorem 174 


Hyperbolic Geometry 174 


Principles of Axiomatic Geometries 174 
A Model of Hyperbolic Geometry 175 
Distances, Angles, and Triangles 176 
Circles 177 

Are Length and Area 177 

Area of Triangles 178 


Geometry of the Hilbert Space 178 


Hilbert Spaces 178 

Examples of Hilbert Spaces 179 
Orthogonal Systems 180 

Schmidt’s Orthogonalization Process 182 
Orthogonal Decompositions 182 


Contents XIX 


Orthogonal Series 184 
n-Dimensional Trigonometrie Functions 184 


Orthonormal Systems 184 

Fourier Coefficients and Absolute Convergence 185 
Periodic Trigonometric Series in L.(Q) 186 
Semiperiodice Trigonometric Series in [,(Q) 186 
Example 187 


Orthogonal Polynomials 188 i 


Approximation Theorems 188 
Legrendre’s Polynomials 189 


Partial Differential Equations 189 
Types of Partial Differential Equations and Physical Examples 189 


Types 189 
Physical Examples 190 


The Laplace-Poisson Equation 191 


Fundamental Solutions and Integral Representations 191 
Green Functions 193 

Properties of Harmonic Functions 194 

Dirichlet’s Boundary Value Problem 195 

The Poisson Equation 195 


The Wave Equation 196 


Uniqueness Theorems 196 

The Wave Equation in One Dimension 198 

Initial Value Problems for the Wave Equation in Two and Three Dimensions 199 
Physical Interpretations, Huyghenian Property, Spherical Waves 200 

The Inhomogeneous Wave Equation, Retarded Potentials 202 


The Heat Conduction Equation 202 


The Singularity Solution 202 
The Maximum-Minimum Principle 203 
The Initial Value Problem 203 


Separation Setups 204 


Introductory Note 204 

The Fixed Loaded Plate 205 

The Separation Setup for the Laplace Equation 206 

The Fourier Method for the Wave Equation 207 

Vibrating Membrane, Vibrating String 208 

The Fourier Method for the Heat Conduction Equation 209 


Contents 
Operators in Banach Spaces 210 
Banach Spaces 210 


Separable Banach Spaces 210 

Special Sets in Banach Spaces 210 
The Space C(Q) 211 
Finite-Dimensional Banach Spaces 211 
Jompletion of Normed Spaces 212 


Operators 212 


Fundamental Concepts 212 
The Space L(.B,, By) 213 
Spectrum and Resolvents 214 
The Space (J,)’ 215 

Integral Operators 215 


Operators in Hilbert Spaces 216 
Classes of Continuous Operators 216 


Isomorphy of Hilbert Spaces 216 
Linear Functionals 216 

Bilinear Forms 216 

Adjoint Operators 217 

Projection Operators 217 

Isometric and Unitary Operators 218 
Compact and Degenerate Operators 218 


The Theory of Riesz and Schauder 219 
Formulation of the Problem 219 
Decomposition Theorems 219 

The Spectrum of Compact Operators 220 


Fredholm’s Integral Equations 220 


The Adjoint Integral Operator 220 
Fredholm’s Alternatives 221 


Distributions 222 
Fundamental Concepts 222 


Introduction 222 

The Spaces D(Q) and D’(Q) 223 
Examples of Distributions 223 
Operations with Distributions 224 
The Space #’(Q) 226 


aimee de 


a a a 


Contents 


XXI 


The Fonvier Transform and the Spaces S(R,) and S’URn) 227 


The Space S(R,) and the Fouricr Transform 227 
Properties of the Fourier Transform 227 

The Space 8’(R,) 228 

The Fourier Transform in S8’(R,,) 229 

Other Properties of Fourier Transforms 230 


Tensor Products and Convolutions 230 


Tensor Products 230 

Properties of Tensor Products 231 
Convolutions 231 

Properties of Convolutions 232 


Partial Differential Equations and Distributions 233 


Fundamental Solutions 233 


Basic Properties 233 

The Laplace Equation 234 

The Heat Conduction Equation 234 
The Wave Equation 235 


Initial Value Problems 236 


Formulation of the Problem 236 
The Wave Equation 237 
The Heat Conduction Equation 238 


Fundamental Concepts of Classical Field Theory 239 


+ 


Tensors 239 


Introductory Remark 239 
The Fundamental Tensor 240 
Tensors 242 

Properties of Tensors 244 
Metric Geodesics 244 


Classical Field Theory 246 


The Model of Field Theory 246 
Lagrange Densities 246 
Lagrangian Formalism 249 


Examples of Field Theories 251 
Covariant Point Mechanics 251 


The Maxwell-Lorentz Equations of Electrodynamics 252 
Interpretation and Transformation of Maxwell’s Equations 253 


XXII Contents 


25. Principles of Special Relativity and Electrodynamics 256 
25.1. Lorentz Group and Space-Time 256 
25.1.1. Minkowskian Space and Inertial Systems 256 


25.1.2. World Lines 258 

25.1.3. The Lorentz Group 259 

25.1.4. Special Transformations of the Proper Lorentz Group 260 
25.1.5. Space-Time (Physical Aspects) 261 

25.1.6. | Space-Time (Mathematical Aspects) 263 


25.2. Effects of Special Relativity 264 


25.2.1. Time Dilatation and the Twin Paradox 264 

De Lorentz Contraction 267 

25.2.3. The Relativistic Addition Theorem of Velocities 267 
25.2.4. The Free Relativistic Particle 268 

25.2.5. Proper Time, Mass, and Energy 268 


25.38. The Maxwell Equations 269 


DH BMoll Formulation of the Problems 269 
DE Bie Tnitial Value Problems 270 


26. Self-Adjoint Operators in the Hilbert Space 271 
26.1. Unbounded Operators 271 


26.1.1. Closed Operators 271 

26.1.2. Closable Operators 272 

26.1.3. Adjoint Operators 272 

26.1.4. Symmetric and Self-Adjoint Operators 273 
26.1.5. Criteria for the Self-Adjointness of Operators 273 


26.2. The Spectrum of Self-Adjoint Operators 274 
26.2.1. The Spectra D4 and Cy 274 


26.2.2. The Spectra D4, and Cy 275 
26.2.3. Compact Self-Adjoint Operators 276 


26.3. Spectral Families 277 


26.3.1. Definitions 277 
26.3.2. Properties 277 


26.4. Spectral Operators 278 


26.4.1, Riemann-Stieltjes Integrals for Functions 278 

26.4.2. Riemann-Stieltjes Integrals for Spectral Fam ilies on Finite Intervals 279 
26.4.3. Riemann-Stieltjes Integrals for Spectral Families on R, 280 

26.4.4. Spectral Operators 280 


Contents XXIII 


26.4.5. 
26.4.6. 
26.4.7. 


Lo 
-l 


1g 
-1 
: 
fwd 
8 


1 sg +] 
— ee 
Rm oS roe 


to te to te te 
-3 +I 


i) 


is 
=a 


Le) 


~1 


=~] 


bo bo bo 
a | 

WwW te 

wee 


NW bo le 
© co 


19 
Do 
se 


The Fundamental Theorem of Spectral Theory 281 
The Spectrum of Self-Adjoint Operators 281 
Operators with a Pure Point Spectrum 282 


Differential Operators and Orthogonal Funetions 283 


Classical Orthogonal Functions 283 


Introductory Remark 283 
Trigonometric Functions 283 
Hermite’s Functions 284 
Legendre’s Functions 285 
Laguerre’s Functions 285 


Surface Harmonics 286 
Beltrami’s Differential Operator 286 


Surface Harmonies as Eigenfunctions 288 
Three-Dimensional Surface Harmonics 289 


Principles of Quantum Mechanics 289 
Axiomatices of Quantum Mechanics 289 

The Hilbert Space Model 289 

The Dynamics of Quantum Mechanical Systems 290 
Stationary States 291 

Interpretations 291 

Bohr’s Postulate 291 

Statistical Interpretation of Quantum Mechanics 292 
Heisenberg’s Uncertainty Principle 293 


Quantization 294 


The Quantization Rule 294 
Examples of Quantization 295 


Single-Particle Problem 297 


~ One-Dimensional Motion of a Free Particle 297 
The Harmonic Oscillator 298 
The Relativistic Free Particle in f, 299 


The Hydrogen Atom 301 


The Hydrogen Atom without Spin 301 
The Zeeman Effect 303 

The Hydrogen Atom with Spin 304 
The Relativistic Hydrogen Atom 307 


XXIV Contents 


28.6. 


28.6.1. 


28.6.2. 
28.6.3. 
28.6.4. 
28.6.5. 


30.1. 


30.1.1. 


Atoms and the Periodic System of the Elements 308 


Atoms without Spin 308 

The Space L3, 4( Ran) 308 

Atoms with Spin 310 

The Pauli Principle 311 

Periodic System of the Elements 313 


Geometry on Manifolds I (Tensors) 314 
Manifolds 314 


The Paracompact Hausdorff Space 314 
C=-Manifolds 315 
Functions on C~-Manifolds 316 


Geometric Objects 317 


Fibre Bundles 317 
Tensor Densities 318 


Tensor Analysis 319 


Fundamental Operations for Tensor Densities 319 
Differential Operations 320 
Integrals on Manifolds 320 


Affine Spaces 321 


Affine Transformations 321 
Normal Co-ordinates 321 
Covariant Differentiation 322 
Translation 322 

Affine Goedesics 323 
Riemann’s Tensor 324 

Flat Affine Spaces 325 


Metric Spaces 325 


Fundamental Tensor 325 

Index Shifting 326 

Characteristic Surfaces 327 

Metric Geodesics 328 

Geodesically Convex Domains 329 

Metric Spaces 329 

Riemann’s Tensor and Related Tensors 330 


General Theory of Relativity | (Fundamental Equations) 331 
Variational Principles 331 


Lagrangian Formalism 331 


Contents XXV 


Kinstein’s Equations 331 
Einstein- Maxwell Field Equations 332 
Some Remarks Einstein Made on Relativity and Quantum Theory 334 


The Energy-Momentum Tensor 336 


Killing Vectors and Laws of Conservation 336 

The Covariance Principle 337 

Energy-Momentum Tensor for Ideal Liquids 338 
Coinparison with Newton’s Theory of Gravitation 338 


Equations of Motion 339 


Test Particles and Electromagnetic Waves 339 
Proper Time and Twin Paradox 340 


Sehwarzschild’s Solution 341 


The Birkhoff Theorem 341 
Eddington’s Form of the Schwarzschild Solution 342 


The Classical Effects of the General Theory of Relativity 344 


Planetary Motion 344 
Deflection of Light 346 
Red Shift in the Gravitational Field 347 


General Theory of Relativity Hf (Singularities, Black Holes, 
Cosmology) 348 


Singular Manifolds 348 


Criteria 348 

The Schwarzschild-Eddington-Kruskal Metric 350 
Closed Trapped Surfaces 352 

Singularities 353 

Black Holes 354 


Theory of Black Holes, Evolution of Stars 355 


The Eddington Metric 355 

Stars 357 

The Hertzsprung-Russell Diagram and the Celestial Scale 358 
The Kerr Metric 360 

Energy Balance of Black Holes 362 


Cosmology 363 
Principles 363 


The Robertson-Walker Metric 364 
The Dust Universe 364 


XXXVI Contents 


31.3.4. Hubble’s Law 365 

31.3.5. Solutions of Friedman’s Equation 366 
31.3.6. | Friedman’s Models 367 

31.3.7. The Big Bang 367 

31.3.8. Birth of Life in the Universe 368 


32. Geometry on Manifolds IT (Forms) 370 


32.1. Tensors and Differential Forms 370 


0 
32.1.1. The Vectors aa and dx*. Tensor Products 370 


32.1.2. The Alternating Product and the Exterior Product 371 
eles Exterior Derivative 372 

32.1.4.  n-Forms 373 

32.1.5. Theorem of Poincaré 373 


32.2. Integral Calculus on Manifolds 374 


32.2.1. Integrals of n-Forms 374 ; 
32.2.2. The de Rham Operator 374 
32.2.3. The Stokes Theorem 375 
32.2.4. Leray Forms 376 


32.3. Distributions on Manifolds 377 


32.3.1. Scalar Distributions 377 

32.3.2. Tensor Distributions 379 

32.3.3. Covariant Derivative and Coderivative of Distributions 380 
32.3.4. The Wave Operator 381 

32.3.5. Distributions of Type f(S) 381 


33. The Wave Equation on Curved Space-Times 382 
33.1. Characteristic Surfaces and Singularities 382 


33.1.1. Characteristic Surfaces 382 

33.1.2. Initial Value Problems for Characteristic Surfaces and Null Fields 383 
33.1.3. Caustic 385 

33.1.4. The Caustic in the Minkowskian Space 386 

33.1.5. Discontinuities of Solutions of the Wave Equation; Catastrophes 387 


33.2. Fundamental Solutions 388 


33.2.1. The Problem 388 

33.2.2. Causal Domains 389 

33.2.3. The Distribution 6,,(/’) 390 
33.2.4. Fundamental Solutions 391 


Contents XXVIT 


33.3. 


33.3.1. 
33.3.2. 
33.3.3. 
33.3.4. 


33.4. 


33.4.1. 
33.4.2, 
33.4.3. 


33.5. 


33.5.1. 
SoeOece 
33.5.3. 
33.5.4. 


34. 


34.1. 


34.1.1. 
34.1.2. 
34.1.3. 


34.2. 


34.2.1, 
34.2.2. 
34.2.3. 


34.3. 


34.3.1. 
34.3.2. 


34.4. 


34.4.1. 
34.4.2. 
34.4.3. 


34.5. 


34.5.1. 
34.5.2. 
34.5.3. 


Solutions of Pu=f, Cauchy Problems 392 


Past-Compact Sets and Distributions 392 

An Existence and Uniqueness Theorem 393 

The Cauchy Problem: Existence and Uniqueness 394 
The Cauchy Problem: Representation 394 


Tensor Wave Equations 396 


Definitions 396 
Fundamental Solutions 397 
Solutions of Pu=f 398 


The Maxwell Equations 399 


Definition 399 

Continuity Equation and Cauchy Data 399 

Gauge Condition and Four-Potential 401 

The Cauchy Problem for the Maxwell Equations 402 


Singularity Theory 403 


Loeal Mappings 403 

Germs of Mappings, the Ideal m(n) 403 
Finitely Determined Germs 404 

Criteria for Finitely Determined Germs 404 
Stability 405 

Definitions 405 

Immersions and Submersions 407 

Global Theorems 408 


Singularities and Morse Functions 409 


Singularities 409 - 
Morse Functions 410 


Mappings in the Plane 411 

Good and Excellent Mappings 411 

Normal Forms of Fold Points and Cusp Points 412 
Whitney’s Theory 413 

Unfoldings 413 

Definition 413 


Associated and Equivalent Unfoldings 414 
Stable and Universal Unfoldings (Definition and Examples) 414 


XXVIII Contents 


34.5.4. Stable and Universal Unfoldings (Criteria) 415 
34.5.5. Reduction of Unfoldings 415 

34.5.6. Minima 416 

34.5.7. Thom’s Theorem 417 


35. Catastrophes: Theory aud Application 418 


35.1. Principles and Models 418 


35.1.1. General Principles and Fundamental Tdeas 418 
35.1.2. The Local Regime 421 

35.1.3. Examples of Application 423 

35.1.4. The Three Interpretations of Catastrophe Theory 424 


35.2. Elementary Catastrophes 425 


35.2.1 The Generic Aspect 425 
35.2.2. Pictures of Elementary Catastrophes 425 
35.3. Applications in Physics 429 


35.3.1. The Van der Waals Equation 429 
35.3.2. Eulerian Deformations 431 
35.3.3. Breaking of Watcr Waves 433 
35.3.4. Catastrophe Machines 434 

35.4. Other Applications 436 

35.4.1. Taylor Series and Cells 436 


35.4.2. | Applications in Biology 436 
35.4.3. Dogs and Mathematicians 437 


Appendix: On the Relation between Geometry and Reality during Time’s 
Changes 439 


References 445 3 
Hints for the Use of the References 448 


Index 449 


This must ye ken! From five and six — 
From one make ten, In that the trick’s — 
Drop two, and then Make seven and eight, 
Make three square, whieh And all is straight; 
Will make you rich; And nine is one, 
Skip o’er the four! And ten is none. 
This is the witeh’s One 
Time’s One! 


_ (Witches? Kitehen, Gocthe’s Faust, I, I, in the 
Sir Theodore Martin Translation, London, J. M. 
Dent & Sons Ltd., 1954) 


ile Numbers and Spaces 


Ll. Real Numbers 


1.1.1. Number Systems 


The real numbers are familiar from school. Here are some concepts we will 
need continually in what follows. 


Natural numbers: 1, 2, 3, ... 
Integers: 0, 1, —1, 2, —2, 3, —3,... 
P n : 
Rational numbers: 7=—, where x and m are integers, m+ 0. 


m 


Real numbers: «= +7yn... NeNE 41M 42+.) Where n; can assume the values 
0, 1, 2, ..., 9 (decimal representation of real numbers). 

The fundamental operations of addition, subtraction, multiplication and 
division as well as the order relations <, =, >, 2 are supposed to be known. 


Powers: Let « be a real number, then we write 


a2=g:a, a=at-a, ..., af=al fsa, 1. (f=2,3,...). 


If, moreover, «+0, we may also form the powers 
1 


g-l==. 
x 


a ee ee ei (G2 3 ele 


It is expedient to set «°=1. Then one obtains a! + a*=aJ** (where 7 and & are 
integers). If one of the two integers j or k is negative, one must additionally 
require that «+0. 


Absolute value: For any real number « we define 
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Numerical axis: As is known from school, the real numbers ‘« can be identified 
with the points P of a straight line (of infinite extension to both sides) (Fig. 1.1). 
This is not, however, as unproblematic as it may appear at a first glance. This 
will be discussed in greater detail in the next subsection. 


ai Fig. 1.1 
a 0 loel 


1.1.2. Distance aud Axiom of Completeness 


Definition 1. Let « and B be real nwmbers, then o(a, 8) =|«—]l ts the distance be- 
tween these numbers. 


Lemma. The distance between real numbers has the following properties: 
1. o(a, B)=0, where o(a,f8)=0 if and only if «=f. 
2. ola, B) = 0(f, a) ° 
3. O(a, B) Sola, 7) +07, &) - 
Notation: Open interval: (a, b)={a|« real, a<«<b}, 
half-open interval: [a, b)={« | areal, asa<b}, 
half-open interval: (a, b] = {« | « real, a<«=b}, 
closed interval: [a, b]={« | « real, a=a=b}. 
a€(a,b) means a<xa<b, with analogous interpretations for the other types of 
intervals; ‘“€”’ means “is an element of”. 


(a, B, y real) . 


Definition 2. Let M be any set of real numbers. The real number « ts called accumu- 
lation point of M if every open interval (a, b) with «€ (a, b) contains infinitely many 
points of M. 


Examples: 1. If M = {a;}721, with ey=a;=a,=...=0 and a, =a,=ag=...=1, then 
M has two accuniulation points, i.e. 0 and 1. 


il 
opltey = {a,)7-)-with ¢= j , the 0 is the only accumulation point of M. 


3. If M=(0, 1), then the real number « is an accumulation point of M if and 
only if «€[0, 1). 


Remark 1. At the end of the preceding subsection we had pointed out that the correspond- 
ence between real numbers « and points of the numerical axis requires further explanation. 
Speaking descriptively, the problem is whether the real numbers fill the numerical axis 
without any gap. We simply forbid the occurrence of gaps by the following 


Axiom: Every bounded infinite set M of real numbers has at least one accumulation 
point. 


Remark 2. A set .V of real numbers is said to be bounded if there exists a non-negative num- 
ber ¢ such that |«|=c for every «¢€ M. 


Remark 3. Our point of view is as follows. The real numbers and the arithmetic operations 
applicable to them are supposed to be known. The above considerations are not thought to 
replace this knowledge. On that basis we will introduce complex numbers in a mathemati- 
cally rigorous manner in the next section. It is possible to begin with the natural numbers 
instead of the real ones. In this case one has to introduce the integers, the rational and, 
finally, the real numbers step by step by an axiomatic method. This allows of many com- 
ments. We shall not take, however, this time-consuming way. 
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1. Complex Numbers 
1.2.1. Definition 

Let a and 6 be real numbers, then the ordered pair z= {a, b} is called a com- 


plex number. z can be graphically represented in a coordinate system (Fig. 1.2). 
Addition and multiplication are defined as follows. 


Definition. Let ay, ao, by and by be real numbers. 
{a 4, by} + {ag, by} = {a + Ag, by + by} (Addition); 
{ay by} + {ay, bo} = {a yay —bybo, ayby +.49b,} (Multiplikation). 


Remark. This reduces the addition and multiplication of complex numbers to the addition 
and multiplication of real numbers. 


1.2.2. Properties 


Theorem. Let z; be complex numbers. Then 


Zytegeeyte, (commutative law of addition) ; 

Zi =o 24 (commutative law of multiplication) ; 
(zy +2) +2,=2;+ (+23) (associative law of addition); 
(Cl epee — 2) (ea 25) (associative law of multiplication) ; 


(z1 +29) * 24= 21° 23 4+29° 23 (distributive law). 
Remark 1. The operation put in parentheses must be carried out first. 


Notation. From now on we will write z,z) instead of z,- z, as well as z;+2)+ 2 
instead of 2, + (2) +2) and z,z9z, instead of z4(z23). The theorem shows that it ts 
irrelevant in which order the operations in z,+2) +2; are carried out. Further it 
is seen that, using the above theorem, it is possible to remove the parentheses 
from any kind of expression in parentheses, e.g. 
(24+ 2923) (24 +25) = 24 (2 + 25) + 2223 (24 +25) 
= 224 1 2254 ZZ q2q + y% es - 

Remark 2. Consequently, with respect to addition and multiplication complex numbers can 
be treated in the same way as real numbers, including raising to powers, 


2, =e, Bo OR ong gi=zi-'z, sigs G2 3, or) r 


1.2.3. Conjugate Elements, Subtraction and Division 


Definition. Let z= {a, b} be a complex number. Then Z= {a, —b} (conjugate cle- 
ment or conjugate number of z) and —z={—a, —)} (Fig. 1.3). 
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Notation (subtraction). We write 
2,;—% =2,;+(—z2). One has 2,—2,.= —%1+2,- 

Lemma. Let z= {a, b} be a complex number and ¢ a real number. Then 
2+ {0,0}=]2, 2°41, OF =41, U2 ee ic crac 


Remark 1. {0, 0} is the neutral element of addition, and {1, 0} is the neutral element of 
multiplication. 


Notation. Let c be real and z complex, then we write cz instead of {c, 0} z. The 
real numbers c are identified with the special complex numbers {e, 0} and will no 
longer be distinguished from now on. By Def. 1.2.1, this is justified in view of the 
calculating rules ' 


{c,, 0} + {eo, 0} = {ey +e, 0},  {e), O}{eg, 0} = {eyeo, 0}. 


Note that all other operations with complex numbers (and hence especially with 
the special complex numbers {c, 0}) are introduced step by step, starting from 
Def. 1.2.1. Thus we have extended the number domain available to us from the 
real to the complex numbers. 


Theorem. (a) Jf z, and 2, are given complex numbers, then the equation 24+ w=zZs 
has exactly one solution. It ts w= 2 — 24. 

(b) If z is complex, then 22 is real. If z+ {0, 0}, then 2z>0. 1 

(e) Lf 2+ {0, 0}, then z-w={1, 0} has the unique solution w=—-2 (where 


rn 
QI 


— is real). 


¥ Le 

(d) If z,+ (0, 0}, then zyw=zy has the unique solution w=— 2 jz. 
Hee 

Remark 2. Following the above conventions, from now on we will write 0 instead of {0,0}, 

and hence also z +0 instead of z+ {0, 0}; by analogy we write zw = 1 instead of z- w={1, 0} 

Remark 3. Using the representation z={a, b}, the formulas given in the theorem can be 

expressed by ordered pairs of real numbers. For example. 


22 ={a, b} {a, —b} = {a24+ b2, O}=a2+b2. 


Remark 4. The theorem enables complex numbers to be subtracted and divided. 2~!=— has 


a; 
: ; : ys 
a well-defined meaning as the solution of (z7!) z=1 (where +0); by analogy — is the 
at 
solution of 242 =z with z,+0. Thus we can-also obtain any powers 271, z-2=(27!)2, 2-3= 
=2-*2~!, provided that z+0. Then we have extended the four basic arithmetic operations 
from the real to the complex numbers. As was already pointed out, from now on we will 
regard the real numbers as special complex numbers. 


1.2.4. Normal Representation 


Definition. Let z= {a, b}, then a= Re z is called the real part and b =m z is called 
the imaginary part of z (Fig. 1.4). The complex number i= {0,1} is called the 
imaginary unit. 
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Lennna. Normal representation: z= Re z+ (inie2 
Example. Let z= {a, b} £0, then 
Re ig ae ihe gle = 
2 
The number i. The following relation holds for i: 
2 —11— (0, 1} (0; 1}={= 1, 0}= —1, 


Consequently, in this sense i is the square root of —1. 


1.3. R,, Cy, and Metrie Spaces 
13.1. The n-Dimensional Real Space R,, 


Definition 1. Let n be a natural number. Then Ry is the set of ordered n-tuples 


(a4, +++) An) of real numbers aj. If «=(ay, .., An) and B=(b4, ..., bp) are two elements 
of Ry, then 
n 


0(2, ae |atj— 6; 
c= 
ws the distance between these elements. 
Remark 1. The elements of Ry are also called points. 
Theorem 1. Let a, 8 and y be points of Rn. Then 
1. o(z, B)2=0, where o(a,B)=0 tf and only tf Le 
2. o(a, B)=o(B, a) ; 
3. o(x, B)=o(a, y)+o(y, B)- 


Remark 2. This theorem is a generalization of Lemma 1.1.2. In particular it is seen that Ry 
is the numerical axis deseribed in Subsee. 1.1.1. and 1.1.2. 


Definition 2. (a) Let e=+0 and a€ Ry, then 
K= {6 | BE Rn, a(6, a)<e} 


ts called an (open) ball (with centre a and radius e). 

(b) Let M bea set of points in Ry, te. MC Ry. Then a€ Ry is culled ace amulation 
point of M if every ball with centre a contains infinitely many points of M. 

(¢) Mc Ry ts said to be bounded /f there exist « positive number K and a BER, 
such that o(a, B)=K for allacM. 


Remark 3. If the property (e) holds for a fixed BE Ry, then it also holds for every other ele- 
ment of Ry. This follows from property 3 in Theorem 1. Conseqnently, whether a set Mc Ly 
is bounded does not depend on the choice of /. 
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Theorem 2. Every bounded infinite set of points of Ry has at least one accumulation 
potnt. 


Remark 4. The property of being an infinite point set does not necessarily mean that the set 


must consist of infinitely many different points. Thus for instance M = {o;}7-1, where «j= 
=(0, ..., 0), is an infinite point set. Its only accumulation point is (0, ..., 0). 


Remark 5. In the case n=1 Theorem 2 coincides with the axiom of Subsec. 1.1.2. 


1.3.2. The n-Dimensional Complex Space C, 


Definition. (a) The set of complex numbers is denoted by C, (complex plane). Ty 
24€ Cy and 2 €C, then 


o(z1, 2) =|Re z,— Re z|+/Im z,—Im 2] 


is the distance between the elements z, and 29 (Fig. 1.5). 


0(2;,2y) 
NY 
Z; 


2 Fig. 1.5 


(b) Let n be a natural number, then Cp is the set of n-tuples (24, ..., Zn) of complex 
numbers z;. Let a=(z,, ., 2n) and B= (wy, «5 Wa) be any two elements of Cn, then 


Rn 
o(2, ) = 2 o(zj, uy) 
ih pana t 
75 the distance between these elements. 


(e) If in Definition 1.3.1/2 the space Ry vs replaced by Cp, one obtains the defini- 
tion for (open) balls, accumulation points, and bounded sets in Cp. 


Theorem 1. If one sets (z, ..., Zn) =(Re z,, Im 2, ..., Re zn, Jim zp), then Cy = Ran. 


Remark. Consequently, the n-dimensional complex space Cy may also be regarded as the 
2n-dimensional real space Roy. 


Theorem 2. Theorems 1.3.1/land 1.3.1/2 remain valid tf Cy ts substituted for Rp. 


1.3.3. The Metric Space 


Definition 1.4 metric space consists of a set M and a real-valued function which 
assigns to every pair a€M and bE M a non-negative number o(a,b)=0 with the 
properties 


(o(G, 6)=0, where ola, b)=0 4) and onl —e 
2. ola, b) =0(b, a) > 
3. o(a, b)=o(a,c)+o(c,b) forall ceM. 


0 ts called a metric. 
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Remark 1. 2, and C,, are special metric spaces. This follows from Theorem 1.3.1/1 and Theo- 
rem 1.3.2/2. Later on we shall give many other examples of metric spaces, which in part ex- 
hibit properties quite different from those of &, and C,,. 


Definition 2. (a) Jf in Definition 1.3.1/2 the space Ry is replaced by an arbitrary 
metric space M, then one obtains the definition of (open) balls, accumulation points, 
and bounded sets in M. 


(b) A subset A of the metric space M ts said to be closed if K contains all the 
accumulation points of K. 


Remark 2. If J/ = R,, then K={0} and K=[0, 1] are closed, whereas K =(0, 1) or A =(0, 1] 
are not closed. 


Lemma. Every subset K of a metric space M is uself a metric space tf the same 
metric as for M is applied to K. 


2 Convergence and Continuity 


UAL Sequences 


2.1.1. Infimum, Supremum, and Limit 


Definition 1. Let {a;};21c Ry be a sequence of real numbers. 

(a) a€ Ry 2 called infimum of the sequence {a;} tf a is the greatest real number 
such thal d=a,; forg—1, 2,3, .., 

(b) a€ Ry ts called supremum of the sequence {a;} if a ts the smallest real number 
such that a2a; for j=1, 2, 3, ... 


Lemna 1. (a) Every bounded below sequence {a;};=1 c Ry (a.e., for which there exists 
a real number M such that M <a; for j=1, 2, ...) has exactly one infimum. (Fig. 
SE) Nie 

(b) Avery bounded above sequence {a;}7_,c R, (i.e., for which there exists a real 
number M such that M >a; for j=1, 2, ...) has exactly one supremum (Fig. 2.1 (b)). 


u iW y Fig. 2.4 
Cp ii HE ep 
0) po 


Notation. inf a; denotes the infimum (if there is any) of a sequence {a;}, and 
4 . . . 
sup a; denotes the supremum (if there is any). As there is at most one infimum 
j . . ° 
and at most one supremum, the notation is meaningful. 


Exainples. The sequence 1, 0, —1, 1,0, —1, 1,0, —1,... has the infimum —1 
and the supremum 1. 
3* 
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De 


: i: : 
2. For fa}? with aj=1+-,, infa;=1 and sup a;=2. This example shows 
J } 


that the infimum (and, by analogy, the supremum too) need not necessarily be 
an element of the sequence. 

Definition 2. (a) A sequence {a;}7.1< Ry ts satd to converge if there exists a number 
a€ R, with the following property: for any positive number e there exists a natural 
number jo(e) such that|a—aj\ Se for all j7=j,(e) (Pig. 2.2). 


eee Fie. 2. 
tt g. 2.2 

| a | 

| 2E t 


(b) A sequence {z;}<C, is said to converge if there exists a number z€C, with 
the following property: for any positive number ¢ there exists a natural number 
joe) such that o(z;, z) Se for all 7=jo(e) (Fig. 2.3). 


Fig. 2.3 


Notation. For convergent sequences we write 


G=UM~ Or ajo 


joo 


Peslias (i eyoen 


jwe 


This is justified, because the limit points a and z, respectively, are well-deter- 
mined: a convergent sequence has exactly one limit. 


Remark. If a; >a, then {a;};—1 has the unique accumulation point a. An analogous assertion 
is true for 2;—<. 


Examples. 
1 =! 

him (14 5)=1 lim (14 )an. 
j 


jres ) jose 


Lemma 2. (a) A convergent sequence (of elements of R, or Cy) is bounded. 
(b) {z;}7.1 c C, is convergent if and only rf both (Re zj7-1c Ry and (Im 2372, C Ry 
converge. 


2.1.2. Properties of Convergent Sequences 


Tn this subsection we shall consider complex sequences. As real sequences are 
special complex sequences, the results are in particular also true for real se- 
quences. z and w (with or without indices) always denote complex numbers. 


Theorem 1. Let z;>z and w;>w. 
(a) Lf Aand pw are complex numbers, then 


Azj + pwj— dz+ pw, ZjWj> ZW . 
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L Ww 


(b) ff z+0, then 2;+0 for j=jy, and —— (7 = jo)- 
(¢) 5-2. Cy 


Definition. {2;}7., <C, ts called a Cauchy sequence (or fundamental sequence) if for 
any positive number e€ there exists a natural number jo(e) such that 0(2z;, 2x) =e for 
all jand k with 7=jo(e) and k= jo(e). 


Remark. This also includes the definition of real Cauchy sequences te; = Cy) pene 


Theorem 2 (Cauchy's condition). A sequence {2j}7_1 CC, converges if and only if 
it 1s a Cauchy sequence. 


2.1.3. Examples 


Lemma 1 (Bernoull’’s tnequality). Let n be a natural number and a> —1, then 
(l+a)"2=1+4na. Here the sign of equality ts valid ¢f and only if either a=0 or n=1. 


Lemma 2. (a) Let k be an integer and O0<a<1, then 
ae 0 7s aco. 
(b) Let k be an tnteger and a>1, then 
nigh +co as n-0 
(this means that n*a” is greater than any given positive number tf n is chosen suffi- 
crently large). 


k 
Lemma 3. {a,};_, weh a, = (1 ab is a monotone increasing bounded sequence; 
1 


ie 


1 k 1 k4+1 
lim (laa) = lim (1+;] =f, 


i co y oo 


k+l 
(Ope —argeel. Op — (1 ar is a monotone decreasing bounded sequence and 


Remark. The lemma says that 
2=4,<t)<...<ay <p, ,<.. <b =<... <b {aby <...<d) <b, =4. 
Here, for the moment, H=sup t= inf 6, is merely an abbreviation, which will, however, be 


of fundamental interest later on. 


Dee Series 


2.2.1. Convergence and Divergence 


oo 


The series to be discussed here are infinite series of complex numbers, >) 2, 
i 


2z;€C,. If 2;€ R;, one has series of real numbers. The partial sums are denoted by 
N 


Sy= 2 ej 


Ve) 
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Definition. ( oS 2; ws sad to CORTE af the sequence a partial sums, {Sy}Na1s 
2 
OORT GES This is written as lim S ee > Bie Otherwise s 2; ws said to diverge. 
ee =1 j=1 
b) > 2; ws satd to converge Pesca if >, o(z;, 0) converges. 
1S j= 
Remark 1. If z; are real numbers, then o(z;, 0) = |2;|. In this Cane 2 z;is absolutely convergent 
J= 
if and only if 2, |z;| converges. 


Theorem. (a) If 2 2; converges, then 20 as k— co, 


(b) Lf > a eer absolutely, then > 2; converges. 
i= =I 


¢) 2 2; converges tf and only if both S Re z; and > Im z; converge. 


j=l ae 
d) >) 2; converges absolutely if and only tf both > Re z; and > Im 2; are 
) j=l gel 
tooth convergent. 


Remark 2. A sequence {w;}j—1 with w;—-0 as j > is called a null sequence. Part (a) of the 


above theorem means that the terms of a convergent series form a null sequence. The con- 
verse of this is not always true, as we shall see in the next subsection. 


2.2.2. Examples 


= 1 
Lemmal. Jf 0O<q<1, then > ¢& (where q@=1) converges, and > G= j 
(geometric series). ray) j=0 aa 


Lemma 2. (a) The harmonic series > 7 diverges. 
co 1 g=1 
(b) For k=2,3,4,..., 2 | converges. 


j=l) 


Lemma 3. The series 1-~+— 
lutely convergent. 


——+—=-——4+... ts convergent, but not abso- 


1 1 
Remark. The scries 1 =a 0 qa tends oscillatingly towards its limit (Fig. 2.4), whereas 
1 
iar as 3 +... Increases indefinitely. 
Jae 


24 


oe Fig. 2.4 
0 i 7 
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2.2.3. Criteria of Convergence 


Theorem 1 (majorant criterion). Let 0Sa;=b;. If 2 b; converges, then > a; 
converges, too. ee 


9 ¢ : : : 
Ba 2.2. Series it 


Remark 1. 2 6; is called a majorant of the series 2 a;. 
j=1 j=l 
Theorem 2. Let u;=0. 
(a) (Ratio test). If there exist a number q with O<q< land a natural number jo, 


such that aj+1=qaj; for j= jy, then >) a; converges. 
j=l 
(b) (Root test). If there exist a positive number C, a number q with O<q<1, and a 


natural number jo, such that a;=Cq! for 7= jo, then > aj converges. 
ja 


j 
Remark 2. Later on we shall explain the symbol Va; (root). Then (b) can be rewritten as 
j 


Va;sq=<1 for j7=jo. This explains the designation “root test”’. 


Remark 3. As the terms of the series considered here are nonnegative, convergence and abso- 
lute convergence coincide. ; 


Exaniple. Let 4 be an integer and 0<a<1, then 


(n+1)* a®+! 


Ke 


nan =q=<l 


for large values of x. Consequently, according to the ratio test, > n*u” converges 
(see also Lemma 2.1.3/2). ae 


2.2.4. Rearrangement, Multiplication, and Addition 


A permutation of the natural numbers is a mapping (nm) which assigns to 
every natural number 7 a natural number g(n) such that for any given natural 
number k there is exactly one natural number n(k) with £=¢(n(k)). Hence, in 
particular, 9(n)+@(m) for m+n. Consequently, the numbers 1, 2, 3, ... are only 
rearranged. 

The series presented in this subsection are complex ones, and hence include 
series of real terms as a special case. 


Theorem 1. Let > z, where zj€C,, be absolutely convergent. Further let w;=zqj), 
i=1 S 

where ¢(j) is any permutation of the natural numbers. Then > wu; is absolutely 

ee . j=l 

convergent, and > z= 2 uj. 
jel jal 


1 


Remark 1. The above assertion is false when >, z; is only required to be convergent (but not 
j=l 
absolutely convergent). This can easily be understood by considering the series J. 


its ah al 


Aaa = 
34°65 
Theorem 2 (major rearrangement theorem). Let > z%, where zjE Cy, be absolutely 
j= 
convergent. Let there exist a one-to-one correspondence between the terms wet and z;, 
where each of the indices k, land j runs through the natural numbers: WEL = Z(k,l)- 
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(a) Then >) wz. ts absolutely convergent for every number k=1, 2, 3, . 
v1 


(b) Let u,= 2 Wily then Ds uz ts absolutely convergent, and Lo p2 Wie 
= 


Remark 2. The sequence ae =; is thus decomposed into infinitely many subsequences 
{wr i=1, where k=1,2,3.... Then these subsequences are summed up separately (part (a)), 
and thereafter the sums obtained in this way are added (part (b)). The theorem remains val- 
id when & assumes only finitely many values, i.e., when {z;} is decomposed into a finite num- 
ber of subsequences. 


Theorem 3 (multiplication theorem). Let the complex series S 2; and 3 wy be abso- 
il a 


lutely convergent. Further let there exist a one-to-one correspondence between the 
terms zw, and the terms uw, where each of the indices j,k and l runs through the 


natural numbers. Then >) wu, ts absolutely convergent, and 
bat . 


(2, s) (2, ee ~ - oa 


Remark 3, What is meant is the following: find all the products z;w, and arrange them in 
some way or other, €.g., 2;w,, 2904, 242, Z3Wy, ZW, %Wy, 2524, -.-, and then denote these 
terms by uy, tg, Us, «.- 


Example. Let 0<q<1. Then it oo from the above theorem that 
(3) (Za)=3 0-00 
k=0 =0 
According to Lemma 2.2.2/1 this gives 
a : 1 
Cd ee: 
ey Eg? 
This example shows that the theorem is also useful in calculating the sums of 


infinite series. 


Theorem 4 (addition theorem). Let the complex series S ajand 2 wy be convergent. 
oo g=1 
Tf Aand u are two complex numbers, then >, (Azj + uw) en and 
i 


2 art m)=a Datu Dm 
— i i 


2s Real Funetions in Ry; 
2.3.1. Definitions 


Let M be a subset of R,, and M +0 (empty set). 


Definition 1. A mapping which assigns to every point x€ M a single value f(x)é R \ 
ws called a real function f defined on M (Fig. 2.5). One writes M = D(f) (domain of 
definition of f). 


tw 
a) 
bee, 


2.3, Real Functions in R, ile} 
F(x) 
wa Wie -f (x) ax+h 
,a ie 8 
2S ee eee hee, Ba ) 
7 ry = g Fig. 2.6 


Examples, 1. D(f) =[0, 1] and j(x)={) £0" Hee Ce DG) = Ry andi) = 


=ax +6 (Fig. 2.6) (where a and 6 are real numbers), or f(x) =|z| (Wig. 2.7). 
Definition 2. Let D(f)=M be the domain of definition of the function f. 

(a) ae f(x) @nfimum of f) denotes the greatest real number « such that m= fr) fer 
all xe M. Bue f(x) (supremum of f) denotes the smallest real number « such that 

cEM 

f(x)sa for allxeM (Fig. 2.8). 

(b) f ts sard to be bounded 7f there exists a positive number K such that |f(x)| = K 
for all xe M. 


Fig. 2.8 


D(f)=C01] 


Remark 1. Compare with Def. 2.1.1/1. It is clear that a function f will have at most one 
infimum and at most one supremum, which justifies the chosen designation. 


Lemma 1. Jf f 2 bounded on D(f), then f has a unique infimum and a unique 
supremum. ; 


Remark 2. By analogy with Lemma 2.1.1/1, the above Lemma can be extended to functions 
which are only bounded from either below or above. 


Remark 3. If there exists a point x9¢J/ such that f(%»)= inf_ f(x), then / assumes its infi- 
mum, which is then the minimum of /, written as 


H(x9) =. . f(z) = oan f(x) . 


If there is a point 7,¢ such that f(x,)=sup f(x), then / assumes its supremum, which is 


then the maximum of f, written as xEM 
la,) =sup f(2) =max fe) . 
zeM ceEM 


Definition 3. Let f be a function with D(f)= M. 

(a) Let x) €M, then f ts said to be continuous at the point x, tf for any e=0 there 
exists a positive number 6=O(e, xo) such that |f(x)—f(xo)| =e for all cE M with 
ie tol=0 (Hig. 2.0). 


14 2. Convergence and Continuity BBe. 
tO OOOO rr 


(b) f is called continuous on M if f is continuous at every point xy»¢M. 

(ce) f 7s called uniformly continuous on M if f is continuous on M and if, for 
any €>=0, the above number 6 can be so chosen that 6 =4(e) depends on e alone, berng 
independent of rae M. 

Remark 4. Uniform continuity is understood as follows. f is called uniformly continuous on 
M if for anye>+0 there exists a positive number 6=0(e) such that |f(x) —f(y)| se for all 
wé€M and y€ M with |e —y|=6. 

Examples. The function of the first of the above examples is not continuous at 
any point 2)¢€D(f)=[0, 1]. On the other hand the functions of the second ex- 
ample, where D(f)= R,, are continuous. 


Definition 4. f(x) —> a, or f(z) +a as x>29, means that f(xj)>a for any sequence 
inet 21) 

{rj}fi1c D(f) with xj> x. 

Lemma 2. f 7s continuous at the point x) € D(f) tf and only tf f(x) — f(a) as «> x. 


Definition 5. f is said to be continuous on the right at the point x y€D(f) if there 
exists a real number a such that f(x;)-a for any sequence {x;}7_, D(f) with xj> xo 
und xj>xp. f ts said to be continuous on the left at the point x9 € D(f) if there exists a 
real number a such that f(aj)-+a for any sequence {xj}7.1C D(f) with xj;> 2x9 and 
Lj= Xe 

Remark 5. Henceforth we shall write x; | a instead of x; with xj><a 9, and x; tx instead 
of #;>% with x;<a). Lemma 2 shows that Definition 5 is a generalization of Def. 3(a). In 
the case of continuity on the right, the only sequences 2; admitted are those converging to 
vo ‘‘from above’, whereas in the case of continuity all sequences converging to a) are admitted. 


Discontinuities. The function f(x) shown in Fig. 2.10 is both left- and right- 
continuous at the point x», but it is not continuous at that point. This is called a 
jump discontinuity. Apart from jumps there are also other types of discontinui- 
ties, e.g., poles (Fig. 2.11) or oscillations of f(2) which increase in frequency as x 
approaches certain points xq (Fig. 2.12). 


f(x) 


Fig. 2.11 


2.3.2. Properties of Continuous Functions 


Theorem |. Let f and g be functions with D(f) = D(g) in the sense of Def. 2.3.1/1. 
Let f and g be continuous at the point x9€ D(f). 
(a) [f A and «are real numbers, then Af(x) + ug(x) is continuous at xo. 


ney | 
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(b) f(x) + g(x) as continuous at xo. 

es : 
(ec) If, moreover, g(x) +0, then om is continuous at Xo. 
qr 
Theorem 2. Let f be a continuous function in the sense of Def. 2.3.1/3 (b). Further 
let D(f) be closed and bounded. Then 

im f(a) ae sup f(r)= max f(z). 
x€éD(f) rE D(f) re D(f) xe D(f) 

Remark 1. If D(/) is regarded as a subset of A, then, by Def. 1.3.3/2(b), D(/) is closed if it 
contains all of its accumulation points in Ay. Examples of closed sets in R, are a closed inter- 
val or the union of finitely many closed intervals. 


Remark2. Theorem 2 makes two assertions, namely (1) that f(x) is bounded and (2) that it 
assumes its infimum and supremum (whose existence, by Lemma 2.3.t/1, follows from the 
boundedness of f). 


Theorem 3 (location principle). Let f be continuous (as per Def. 2.3.1/3 (b)), and let 
D(f) be an interval. If « is a real number such that 


inf f(7)<a<sup f(x), 
then there exists at least one point x9€D(f) such that f(xo) =« (Fig. 2.13). 


Remark 3. In the last theorem we did not assume that D(f) be closed and bounded. f may be 
unbounded for open or half-open intervals D(f). Then in the theorem one has to set inf f(x) = 
= — ~ (if fis unbounded below) or sup f(x) = = (if f is unbounded above) (Fig. 2.14). 


Theorem 4. Let f be continuous (as per Def. 2.3.1/3 (b)), and let D(f) be closed and 

bounded. Then f is uniformly continuous (Def. 2.3.1/3 (c)). 

Remark 4. The sawtooth-like function shown in Fig. 2.15 is continuous on D(/) =(0, 1], but 
1 

oat al 

where n=1, 2, 3, ... This shows that the assumption made in the last theorem, namely that 

D(f) be closed, is essential. 


not uniformly continuous; the curve is made up of triangles of height 1 and base 


Fig. 2.14 


2.4. Continuous Mappings in Metric Spaces 


Oriele Definition 


With o(z, y)=|x—y\, Ry becomes a metric space. If fis a function in the sense 
of Def. 2.3.1/1, with D(f)=R,, then the continuity of f at the point ato € Df) 
(Def. 2.3.1/3 (a)) may also be described as follows: for any «+0 there is some 
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5=0 such that 0(/(x9), f(x))<e for all x€ Ry with o(xy, z)=6. This formulation 
can however be generalizcd immediately. 


Definition. Let M, and My be two metric spaces with the metrics 0, and 05, respec- 
tively. Further let fbe a mapping of M, into Mo (2.e., every x€ M, has some f(x) € My 
uniquely assigned to rt) (Fig. 2.16). 


Fig. 2.16 


(a) f is called continuous at the point x9€ M, if for all e>O0 there exists a positive 
number 6=6(e, xq) such that oo(f(xo), f(c)) Se for all ee M, with o4(xo, x) =6. 

(b) f ts called continuous on M , tf f ts continuous at every point x9 € My. 

(ce) f ts called uniformly continuous on M , cf f iscontinuous on M, and rf, for all 
e>O, the above number 6 can be so chosen that 6=6(e) depends only on e, but not on 
Xo € eae 
Remark 1. This is analogous to Def. 2.3.1/3; see also Remark 2.3.1/4. 


Remark 2. A subset V of a metric space J/ is itself a metric space (with the same metric). 
If the above mapping f is defined only on a subset D(f) (domain of definition) instead of the 
whole set J/,, then the above definition can be applied by substituting D(f) for A/,. In this 
sense the above definition is in fact a generalization of Def. 2.38.1/8. 


2.4.2. Examples 


As was pointed out at the beginning of Subsec. 2.4.1. (see also Remark 
2.4.1/2), Def. 2.4.1 is a generalization of Def. 2.3.1/3. It is of interest to consider 
other examples, too. 


Complex-valued functions. Let 1, be a subset of Ry, and M,=C,. Then f(z) isa 
complex-valued function and can be decomposed as follows 


{(z) =Re f(x) +i Im f(x), xeM,. (1) 


ioe eles 


Lemma 1. The complex-valued function f(x) 7s continuous tf and only tf the real. 
functions Re f(x) and Im f(x) are continuous. 


Real functions in R,. Let M, be a subset of R, and M,= k,. The mapping f can 
be thought of as a surface represented versus Ry; n=2 has an immediate geo- 
metric meaning (Fig. 2.18). It is clear what is meant by continuity in the sense 
of Def. 2.4.1. 
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F(x) 
0 (fx), fC) 


Fig. 2.18 


Complex-valued functions in R,. Let M, be a subset of R, and M,=C;. The 
decomposition (1) as well as Lemma 1 are applicable in this case, too. 


Mappings in R,. Let M,=My= Rp, and let f(z)=(f,(zx), ..., fa(z)), where f;(x) are 
real-valued functions in R,. f(x) is called a ee. tee 


Lemna 2. The vector function f(x) =(f,(x), .-, fa(x)) ts continuous tf and only af 
the real- lated! functions A(x y= a es ee are mappings of Ry tnto Ry, are 
continuous (componentwise continuity). ; 


Linear mappings. Let a;.x, se (eee onde le. 7, be Teal itmubercs 
then the vector function f(x) =(f;,(x), ..-, he with 


= sl oe — es 
K=1 


is continuous. 


DA Be Real-Valued Continuous Funetions in R,, 


The considerations made in Subsecs. 2.3.1. and 2.3.2. can be transferred from 
R,to Ry. Let f be a function mapping D(f) c Ry into Ry. Definition 2.3.1/4 can be 
adopted word for word. 


Lemma. f 7s continuous at the point xy€D(f) if and only if f(x) f(xo) a8 &> Xo 
(analogue to Lemma 2.3.1/2). 


Theorem 1. Let f and g be two real-valued continuous functions in Fy, wih Dif)\= 
= D(q), and let both functions be continuous at the point xy € D(f). 

(a) Let A and pw be real numbers. Then Af(x)+ug(«) as well as f(x) + g(x) are 
continuous at the point xy. 

(b) Lf g(xo) +9, then a 7s continuous at the point xq (analogue to Theorem 2.3. it) 

glx 

Theorem 2. Let M be closed and bounded in Ry. Further let f be continuous on M = 
= D(f) (Def. 2.4.1 (b), where M=M,). Then 

(a) inf f(z) = min f(x) and sup f(a) =max f(x), ond 


zen ae M xen ce aM : 
(b) f zs CDOS Pe ae (Def. 2.4.1 (c)) (analogue to Theorem 2.3. 2/2 and 


Theorem 2.3.2/4). 
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2.5.1. Definitions 


Definition 1. Let M be a metric space with the metric o. 

(a) A sequence {x;}7.1c M is said to converge if there exists an element x96 M 
with the following property: for any positive number é there is a natural number jo(e) 
such that o(xo, x;) =e for all 72 jo(e). 

(b) A sequence {x;}7.4< M ts called a Cauchy sequence (or fundamental sequence) 
if for any positive number e there exists a natural number jo(e) such that o(x;, xp) Se 
for all j and k with 7=jole) and k= jo(e). 

Remark 1. This is a generalization of Def. 2.1.1/2(b) and Def. 2.1.2, which apply to M =O. 


Lemma 1. Every convergent sequence (in the sense of the above definition) is a 

Cauchy sequence. 

Remark 2. By Theorem 2.1.2./2, the converse is also true for Jf =C,. For general metric 

spaces, however, this is not correct. Consider, for example, Jf =(0, 1] with the usual dis- 
1 

tance-type metric, then oie is a Cauchy sequence, but does not converge. 

Definition 2. A metric space is called complete if each of rts Cauchy sequences is a 

convergent Sequence. 


Remark 3. Remark 2 shows that the definition is meaningful. 
Lemma 2. &, and O, are complete metric spaces. 


Remark 4. The lemma follows from Theorems 1.3.1/2 and 1.3.2/2. The space M =(0, 1] men- 
tioned above is an example of a metric space which is not complete. 


2.5.2. The Space C[a, b] 
Definition. Let —.<a<b<~, then C[a,b] = {f | f real and continuous on[a, b]}, and 
olf, g)= sup | f(x) —g(z)). (1) 
r€[4,b] 


Remark 1. Hence C[a, 6] is the space consisting of all real-valued continuous functions on the 
closed interval [a, b]. The ‘‘points” of this space are functions. If f and g are continuous on 
[a, 5), then /—g is likewise continuous on [a, 5]. Hence, as [a, 5] is closed, f —g is bounded and 
the sup in Eq. (1) may be replaced by max (Theorem 2.3.2/2). 


Theorem. O[a, b] is a complete metric space. 


Remark 2. The metric in C[a, 6] is given by (1). Onc has to show that 9 satisfies the condi- 
tions of Def. 1.3.3/1. 


Remark 3. Instead of the closed interval [a, 6] one may also consider the open interval (a, b). 
Then C(a, 6) ={f| f real, continuous and bounded on (a, 6)}. Here it must be required in 
addition that / be bounded. With the metric (1), O(a, 6) is again a complete metric space. It 
is easy to see that there exist some /€ C(a, 6) which cannot be extended to the points a and 6 
in such a way that after the extension they belong to C{a, 6] (Fig. 2.19). On the other hand, 
the restriction of /€C[a, 6] to (a, 6) always belongs to O(a, b). 


Fig. 2.19 


QDadh- 2.5. Complete Metric Spaces 19 


2.5.3.  Banach’s Contraction Mapping Theorem 


Definition. Let M be a metric space with the metric 0. A mapping f of M into M is 
called contracting if there exists a number « with O=a<1 such that 

olf(x), ((y)) Sele, y) (1) 
for all x€M and all y€M (Fig. 2.20). 


Fig. 2.20 


Lemma. A contracting mapping is continuons. 
Remark. Hence one has to check that Def. 2.4.1 (b) is satisfied for M,=M,=M. 


Theorem (Banach’s contraction theorem). Let M be a complete metric space, and 
let f be a contracting mapping in M. Then there exists exactly one point x€ M such 
that f(x)=<x (fixed point). 

Example 1. Let M=R,, and let f(x) be a continuous function on Ay, with 
f(x) —f(Wlea lx—y! for all re R, and all yc R,. Here, 0Sa2~<1. If f(x) is inter- 
preted as a mapping of #, into R,, then the above theorem is applicable. xo is the 
only fixed point with /(z)= x (Fig. 2.21). 


Example 2. The plane Ry, or a subset M of R,, becomes a metric space if the 
usual distance between any two points is taken as a metric. If M is a circular 
ring and f is a rotation about the common centre of the two disks, then (1) 
holds with «=1 (Fig. 2.22). On the other hand, in general such rotations have 
no fixed points. This shows that «<1 is essential in the theorem. 


Fig. 2.22 
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3. Differential and Integral Caleulus in R, (Basic Concepts ) 


3.1. Differentiation 
Boll all. Definitions 


In this chapter we shall consider (with a few exceptions) only real-valued func- 
tions in the sense of Def. 2.3.1/1. If f is not real, this will be expressly mentioned. 


Definition. Let f be a real function, and let D(f) =(a, b) be an open interval. Let 
Bae Gy ON CHIG. 3.1) 


[ 


| 
Xply| Xothy 


(a) f(x) as called differentiable at the point x, af 


froth) —f(a) ae 


a as h-O, 


d 
where |h| =hy and h+0. One denotes x=f' (29) -t (Xo). 


(b) f(x) as called differentiable on D(f)=(a, b) af f(x) is differentiable at every 
point xa€ (a, b). 
(e) f(x) 28 called right-hand differentiable at the point xo if 


(ot N=10 5 os mo, 


where V<h<hg. One denotes B=f!,(xo). f(x) vs called left-hand differentiable at the 
point x uf 

f (xo +h) —f(xo) ses 

ue ? 

where hy<h<0. One denotes y=f'(x0). 


as Ip tO) it 


e 


Remark 1. h ¢0 and hy 0 were explained in Remark 2.3.1/5. When D(f) =[a, 5), one may also 
find the right-hand derivation at the point aa. By analogy, one may find the left-hand 
derivation at the point b when D(f) =(a, 6). 


Remark 2 (geometric interpretation). Let f(a) be differentiable at the point ap, then the 
straight line 

g(x) =f(Xo) + f(x) (% — 2X0) (1) 
is ealled tangent (to the eurve f(x) at the point a) (Fig. 3.2). It is easy to see that g(a) has 
the approximation property that 
{(x) — g(a) 


() AS PPS my 
aay | ae (2) 
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Lemma 1. (a) Let f(x) be differentiable at the potnt ao, then the line (1) is the only 
line in the plane which has the property (2). 


(b) f(x) us differentiable at the point xo tf and only if f(a) vs both right-hand and 
left-hand differentiable at xo and f',.(xq) = f_(xo). In this case, f’ (ao) = f',(%) =f (xp). 


Remark 3. The roof-like curve shown in Fig. 3.3 is both right-hand and left-hand differen- 
tiable at zp, but not differentiable at that point. 


Lemma 2. If f(x) ts differentiable at the point xo, tt 1s also continuous at Lo: 


Remark 4. Consequently. differentiability is more than continuity. 


gtx) 


F(x+h)-F (Xp) 


f(x) 


Ble. Rules 


Theorem 1. Let f and g be differentiable at the point xp. 

(a) Let 2 and uw be real numbers, then Af+ug ts differentiable. at the point xo, 
and (Af + ug)’ (xo) = Af (xo) + Hg (0)- 

(b) fg ts differentiable at the point xq, and 


(19) (0) =f (0) (xo) + f(x0) 9’ (&p) - 
(c) If g(x) #0, then f is differentiable at the point x, and 


f\’. , £0) g(x) — fx) go) 

1) Cn — eee ye eae 

g g°(xo) 
Theorem 2 (chain rule). If g vs differentiable at the point xp, and if f is differentiable 
at the potnt g(xo), then h(x) =f(g(x)) vs differentiable at the point xo, and 


h’ (x9) = f'(g(%o)) 9 (20) 


3.1.3. Examples (Rational Functions) 


Let x be a natural number, then 
eens | HO. Beas 
(eo ee for eh, 2-0. 
Hence, for any integer k, the following relation holds: 
(a = for eh, 2-0. 
Using this formula together with Theorem 3.1.2/1, it is possible to calculate the 
e 


2 aj 


6 : : é R=0 : 
derivation of any rational functions &(r)=—4;-— , where attention must 


SM GR op 
2, bya! 
. 720 
be paid to the zeros of the polynomial in the denominator. 
4 Triebel, Math. Physics 
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3.1.4. Inverse Funetions 


Definition. (a) The open interval (a, b) Dif) (Fig. 3.4) ts called monotony 
interval of the function f if f (ncreases monotonically in (a, b) (i.e., f(a) = f(x) for all 
ry and x, with a<xq=x,<b) or decreases monotonically in (a,b) (ze., f(%)= 
= f(a) for allagand x, with a<xy=ax,<b). If f(xy)=f(%9) (or f(xy) <f(%9)) forall xq 
and x, with a<xy<a,<b, then f is called strictly monotonic increasing (or strictly 
monotonic decreasing). 

(b) Lf y=f(x) ts strictly monotonic (increasing or decreasing) in (a, b), then the 
mapping y>x is called inverse function, denoted by f—' (Fig. 3.5). 


F(x) 


Fig. 3.4 


Remark 1. Let / be strictly monotonic increasing in (a, 6). Then f is a one-to-one mapping 
of the points x9 €(a, 6) onto some set of points on the y-axis. f is determined by the mapping 
f:x%—>Yo. The inverse function is determined by the mapping f~!: y, 2p. The roles of the 
x- and the y-axis are interchanged. Plotting /~'(a) again as a function of « can simply be 
done by reflection of the curve with respect to the line y=. 


Theorem. Let f(x) be differentiable on the interval (xy—46, x +6), where d=>0. Let 
f(x) be continuous at the point x, f(x) +9 and Yo=f(xo). If ¢ ts sufficiently small, 
then f vs strictly monotonic in the interval (xg—e, xy+e). The inverse function f-1 


ts differentiable at the point yo, and (f—1)’(y) =e 
: Xo 

Remark 2. It is easy to show that the last assertion follows from the above reflection prin- 
ciple. if one sets «=a, and #=Yyo, taking into account that the derivative describes the slope 
of the tangent line. 


3.1.5. Mean Value Theorems 


The function f is called continuously differentiable on (a, b) if f is differentiable 
on the open interval (a,b) and the derivative f(x) is continuous on (a, b). 


Theorem 1 (Rolle’s theorem). Let f be continuous on [a,b] and continuously dif- 
ferentiable ou (a,b). If fla)=f(b)=9, then there exists at least one point xy €(a, b) 
such that f(g) =0 (Fig. 3.6). 


Remark 1. xj may be chosen as a point where f(x))= sup f(x)= max f(z). 
xé[a,b] xé[a,b] 


Fig. 3.6 
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Theorem 2. Let f and g be continuous on [a, b] and continuously differentiable on 
(a, b). Further let g’(x)+0 on (a, b). Then there exists at least one point xo € (a, b) 
such that 


f(b) =a) _ fe) 
glb)—gla)— g’(Xo) : 


Remark 2. g(x) is strictly monotonic in [«, 6]; m particular g(b) + g(a). 


(1) 


Remark 3. Let g(x) =a. Then it follows from (1) that there exists an x €(a, 0) such that 
f(b) — f(a) 
= F(x . Geometrically, this means that the tangent at 2» is parallel to the line 


passing ces the points (a, f(a)) and (6, f(6)) (Fig. 3.7). 


Theorem 3 (’Hospital’s rule). Let f and g be continuous on [a, 6] and continuously 
differentiable on (a, b). Further let f(a)= g(a) =9, g(x) +9 for x€(a, 6), and g(x) +0 


for x€(a, b). Hf im ie) exists, then lim i) exists, too, and 
z}a g’(x) zie g(2) 


1 


Remark 4. If f(x) ~ and g(x) > © asa Ja, then the theorem can be applied to = (= ~) é 
H(z) 


Remark 5. It is possible to formulate analogous theorems for x t@ or for xa. 


Examples. Let x be a natural number, then 


eee ic 
ling ——— = lim == . 
za>ir- xt 
Nn 3 3 — I\e n—1 ® — Ul) [onc 
lim Se gue See =lim G dil! == ((). 
z-1 (y—1)2 a1 2 (x-1) a+) 2 


The second example shows that the theorem can also be applied iteratively. 


3.1.6. Higher-Order Derivatives; Derivatives of Complex- Valued Funetions 


Higher-order derivatives. It is clear that Def. 3.1.1 can be used iteratively. If 
f(x) is differentiable on the interval (a, b), one may ask whether /’(z) is differen- 
tiable at the point r9€(a, b). If so, we write f(x) =(f')’ (x9). Iteration: {OGG = 
= ({@~ 0D)" (Xo). 

Complex-valued functions. If we consider (by way of exception) complex-valued 
functions f(x)€C, with D(f)=(@, 6), then Def. 3.1.1 (a) can be adopted word for 
word, provided that 


f (xo +h) — flo) 


- f(xy) as h-0 
h 


is now understood as convergence in Cy. 


4* 
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Lemma. The complex-valued function f(x) ts differentiable at the point x9 if and 
only if the real-valued functions Re f and Im f are differentiable at the point xp. 


If this is the case, then 
f (x0) = (Re f)'(xa) +i(Im f)’(xo) « 


3.2. Integration of Real-Valued Functions 


3.2.1. Definition of Riemann’s Integral 


Let —o<a<b<oo. Consider a subdivision Z of the interval [a, 6] (Fig. 3.8): 


C—6)— (on 0 Oe (1) 


Fig. 3.8 


Definition. Let f(x) be a real-valued bounded function on D(f)=[a, 6). 


(a) Then one denotes 
: n 


f(s —sup Doemewnerc 6 a, a) ee ee 
J - ; 


=1 YE la;.05 4.4) 
- n 


(hee dose De, where >3*= > @ei—e) sup (G7). 
Z 


j=0 yE[a;,a; 44] 
(b) f(x) ws called integrable if f f(x) dxr=f f(x) dx. 
Remark 1. sup is the supremum taken over all subdivisions Z of the form (1), where 7 is 
Z 


variable (supremum taken over all” and all points of subdivision) (inf is defined analogously). 
Z 


Since /(@) is bounded, inf f(y) and sup f(y) exist for the corresponding subintervals. YF is 
called lower sum, corresponding to the sum of the shaded rectangles. f is called lower inte- 


gral. Accordingly, 5z” is called upper sum and f is called upper integral. 
Remark 2. If f(x) is integrable, we write 
= b 
; J Ge) dee je) Ghe= if @) de= ff en) de . 
iW a 
where the last notation indicates the integration interval. This is Riemann’s integral. 


Lemma 1. f f(x) dx =f f(x) dae 


Remark 3. A function is thus called integrable if and only if the sign of equality holds in 
Lemma 1. 

Lemma 2. Let f(.c) be integrable over the interval [a, b|. Further let yj be any point of 
[a;, aj41] according to subdivision (1). Let 


n 


Liz Li (j+1—4) flys) - 


7=0 
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Then there extsts a sequence Z. of subdivisions such that 
4) Alok x yp ee F 
f f(x) dz =lim Li = lim a =lim aR 


Remark 4, One has 22,= 22, = Dz. , the integral f {(«) dx being approximated from below 
by & Pep and from above by 2a 


Lemma 3. Let f(x) be integrable over [a, b], and let c€ (a, b). Let Z denote all sub- 
divistons (1) which contain cas a point of subdivision. Then 


(a) fz) dxr=sup 2*,=inf D**. 
ze Ze 
(b) Lemma 2 remains true when the set of possible subdivisions ts restricted to 
those of Z°. 
3.2.2. Properties 


Theorem 1. Let f and g be integrable over (a, b}. 
(a) For any real numbers A and p, Af+yg ts integrable over [a, b], and 
f(Aflx) +ug(e)) dr=a f fla) de +p f g(a) de. 
(b) If, in addition, f(x) = g(x), then 
f f(x) dx=fg(x) dr. 
Theorem 2. Let f be cntegrable over [a, b}. 
(a) For any c€ (a, b), f ts tntegrable over [a, c] and over [c, b], and 
b c b 
Sfx) dx=f f(x) dx+f f(x) dx 
a a € 


(b) |f(x)| as Bis over [a, b], and 


f fe ) dx 


Remark. If f is integrable over [a, 6] and if a<c<d <b, then { is also integrable over [ce, d]. 
This follows from (a). 


=f Ife) | dx. 


3.2.3. Commutation of Passage to the Limit and Integration 


Definition. Let —o<a<b<o. A sequence {f;(x)}j_1 of real- (or complex-)valued 
functions with D(f;)=(a, b) is said to converge uniformly towards f(x) with D(f)= 
= (a, b) af for each e=+0 there exisis a natural number jo(e) such that 

sup | f(x)—f(x)l Se 


x €(a,b) 


for all j with 7 =jo(e) (Fig. 3.9). 
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Remark 1. Uniform convergence of /; towards f is written as /;=>/. 

Remark 2. [f the functions /; are continuous on [@, 5], then uniform convergence coincides 
with convergence in the space C[a, 6] defined in Subscc. 2.5.2. Since Cla, 6] is complete 
(Theorem 2.5.2), it follows that fEC[a, b]. 

Theorem. Let the functions fj(x), where j= 1, 2, 3, ..., be integrable over [a, b]. If 
ff on (a, b), then f(x) ts also integrable over (a, bj, and 


(7) dali fad (1) 
f} oo 
Remark 3. (1) can also be written as 
f (lim f,(x)) da = lim f f,(x) da 
jr jee 


(this means that passage to the limit and integration are commutable). 


~ 


3.2.4. Examples of Integrable Functions, and Count er-Examples 


Let —o<a<b<o. A real-valued function f(x) with D(f)=[a, b] is called 
piecewise continuous if f(x) is continuous on the right at every point xr9¢{a, b) 
and continuous on the left at every point 29 € (a, b] (Def. 2.3.1/5) and if the num- 
ber of points of discontinuity is finite (Fig. 3.10). The discontinuities are there- 
fore jump discontinuities as described in Subsec. 2.3.1. Functions of this kind 
are bounded. Continuous functions are special cases of piecewise continuous 
functions. 


Theorem. Every function which is piecewise continuous on [a, b] is integrable over 
{a, b]. 
Remark. In particular, /(x) =c is integrable, and 


f f(a) dw=c (b—a). 

1 for x rational : : 
-—1 ae Be pn Sp 
not integrable (in the sense of Riemann). For it is immediately clear that all 
the lower sums are >)% = —1 and all the upper sums are >)3* =1. On the other 
hand, |f(~)|=1, and hence is integrable over {0, 1]. This shows that the converse 
of Theorem 3.2.2/2 (b) is not true. 


Counter-example. The function f(x) = 


3.2.0. Primitives 


Definition. Let —o<a<b<oo. A real-valued function f(x) with D(f)=[a, b] ts 
called Lipschitz-continuous if there exists a positive number M such that 
| fla) —f(%9)| = M|x,—29) for all x, and all xy in [a, 6]. 


Belo 3.2. Integration of Real-Valued Functions 27 


Lemma I. (a) Every function which is Lipschitz-continuous on fa, b] is also uni- 
formly continuous on fa, b] (Def. 2.3.1/3 (c)). 

(b) Tf f ts continuous on [a,b] and differentiable on (a, b), with sup |f’(x)| <<, 
then f is Lipschitz-continuous on (a, b]. ACD) 


Remark. A roof-like function is Lipschitz-continuous on [a, 6], but not differentiable at the 
point 24 (Fig. 3.11). The inverse function / of y=2? is continuous on [0, 1], but not Lipschitz- 
continuous (Fig. 3.12). 


Vig. 3.12 


Theorem 1. (a) Tf f ts ¢ntegrable over [u, b], then 
2h 
F(x)= f fly) dy ts Lipschitz-continuous on [a, b]. 
a 


(F(x) is called the primitive, or Newton’s indefinite integral, of f(x)). 

(b) If f(x) ts continuous on fa, b], then F(x) ts differentiable on (a, b), and F’(xr) = 
= f(e). 
Lemma 2. /f f’(x)=0 on (a, 6), then f(x) ts constant on (a, b). 


Theorem 2. Let f and g be continuous on {a, b); let g be differentiable on (a, b) and 
g (x)=f(x) for xe (a, b). Then 


b 

Sf fly) dy=gb)—g@) - 
ght) 
Example. For any natural number n, f(x) =.” and g(x) = i 
ties required in the theorem. It follows that wes: 


: bt +1 
a 
eles ae 
f wil ie Il 


have the proper- 


m+ 


a 


3.2.6. Integral Operators 


€ 


Let —o<a<l<o, then to every function f€C[«, b] (Def. 2.5.2) there is 
assigned a function Hf by 


(A(x)=fhly, fy) dyte, x€fa, b). (1) 


Here c denotes a constant and A(y,z) is a real-valued function with D(h)= 
=[a, b])x Ry={(y, 2) |asysb, z€ hy}. Naturally, h(y, z) must be subjected to 
additional conditions in order that (1) be meaningful. What is desired are condi- 
tions which make /H[ a continuous mapping of C[a, b] into Cla, b] in the sense of 
Def. 2.4.1(b). [In the notation used there, M,= M,=C{a, 6], and e,=e)=¢@ has 
the meaning defined in Formula (2.5.2/1), while the mapping denoted f in Def. 
2.4.1 is now denoted /.] 


28 3. Differential and Integral Calculus in Ry i 3.2.6. 


Theorem 1. Let h(y, z) be continuous on D(h)=[a, b] x Ry, and let h satisfy the 
Lipschitz condition 


jh(y, 2) —h(y, 2*)|=M le—2*| 


for all y€[a, b], z€ Ry, and z*€R,. Here M ts a positive number ae independent 
of y, 2, and z*. Then H is a continuous mapping of C[a, b] into C[a, b], and 


o(Hf, Hg)=M (b—a) olf, g) forall feCla,b] and géC[a, bd]. 
Moreover, (Hf)(x) is differentiable on (a, b), and 

(Hf) (x) =Ma, x) for x€ (a,b). (2) 
Remark 1. For /€C[a, 6], h(x, f(x)) is continuous on D(h). 
Definition. Let —.<a<b<o, then one denotes 

Cla, 8] x. Cla, B= (Ff Hhaleds os Male}, He) €Ola, BY 


n ni 


Ama Melieseelle 


j=1 xela,d] 
CT eng == (amen On) 
Lemma. C[a, b] x ...x Cla, b] with the metric on is a complete metric space. 


Let us consider the vector-valued analogue to (1), namely 
(Asf)(2) = Aythy, > fn}(x) 
=fhilys Hy), fly) Aybe, rela, d], (3) 
where the c; are again constants, 7=1, ...,n. We set 
Af={H yf, ... Hnf}. 
Here the hj(y, 24, ..., 2n) are real-valued functions with the domain of definition 
Dh) = |G OX Te — (24, ee 2) = =e ee re ae 


Theorem 2. Let hj(y, 2, ..-, 2) be continuous on D(h;)=[a, b] xX Ry and satisfy the 
Lipschitz condition 


JgQs 24s 5 2n) Tl, 20, «, 2 )| = (lz;—27¢ | 4-12, —25)) 


for all y€[a, b], (21, «5 2n)ERn, and (27, ..., 2%) € Ry. Here M is a positive number 
being independent of Y, 24; +++) %n» Z's + 2. Then H is a continuous mapping of 
Cla, b] Xx... Cla, b] into itself, and 


onl}, Hg)=nM(b—a) onlf, 9) (4) 
for all f and g in C[a, b) x... x Cla, b]. Moreover, (H;f)(x) is differentiable on (a, b), 


and 
(Hf) (x J=hila, Pap erer Jnl) ie 


Theorem 3. 1f nM(b—a)< then the mapping H of Theorem 2 has eractly one 
fixed point in Cla, b]x...x Cla, b]. 


Remark 2. Theorem 3 follows from (4) and Banach’s contraction theorem 2.5.3. Conse- 
quently there exists exactly one n-tuple {f,(2), ..., fn(x)} with f,(~) €C{a, b], such that 


1) Gy PAGS, 1) eo, Fale) ie) — et 
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Then it follows from Theorem 2 that /;() is differentiable on (a, b) and that 
eel ine), ee (% Oe 7—1,..., 0. 


Equations of this kind, in which functions /,(2). ..., /,(v) and derivatives of these functions 
are connected with one another, are called ordinary differential equations (or, more pre- 
cisely, systems of ordinary differential equations). The investigations presented in this sub- 
section may also be interpreted in such a way that we have shown the existence of solutions 
of differential equations. provided that the functions h; satisfy certain conditions. 


4, Ordinary Differential Equations (Existence and Uniqueness 
Theorems ) 
4.1. Initial-Value Problems 


4.1.1. The Differential Equation f@(x)=0 


In this chapter we will consider only real functions. Let » be a natural num- 
ber, then 


fey= > ape x€(a, b), -w~<a<b<~, Gl 


j=O0 


is a polynomial of degree n —1, with the real coefficients a;. One has f(xr)=0. 
If n=1, then Lemma 3.2.5/1 shows that the converse is also true: from /’(x)=0 
it follows that f(x) =a, in (a, b). The following lemma generalizes this assertion. 


Lemma 1. [ff 7s n times differentiable on (a, b) and f™(x)=0, then f has the form (1). 
Remark 1. Therefore, polynomials of degree n—1 can also be characterized by f(x) =0. 


Lemma 2. Let x9¢ ky and c5€ Ry, where 7=0, ..., n—1. Then there exists a unique 
solution of f™(xr)=0 in Ry, where fx9)=c; for 7=0,...,.n-1 (fM=Pf). 
Remark 2. This solution can be written down immediately: 

n—t Cy ; : 

jai J! 


Problem. The question arises whether the above assertions on the special differential 
equation f™(x)=0 can be extended to more general differential equations of the form 


f(x) = hla, fo), f(a) v9 FO“ (2) - (2) 


Does Eq. (2) have a unique solution with the initial values f(xy) =¢; for 7=9, «.., 
n—1? 
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Se 
4.1.2. Formulation of the Problems 

In Subsec. 3.2.6., [a, 6] Ry was introduced for —o<a<b<o (Fig. 4.1). A 


function was said to be continuously differentiable if it is differentiable and its 
derivative is continuous. 


Problem 1 (Ordinary differential equations of order n). Let h(a, 14, .-.,%p) be 
continuous on|[ a,b] Xx Ry and satisfy the Lipschitz condition 


[Altos L245 «+1 tn) — h(2o, 2}, -., T2) |S UM (le, —aF| +... 4+ len — a) ) (1) 


for all xo€[a, 5), (a4, »5 Un) € Rn and (xj, ..., cF)€ Ryn. Here M is a positive number 
DPLEDENU CHT OC} Moy Rinuuay ins Ui ym, Oy ee UE Ler el Gree ye) eee wl 0d ao 
function f which is n times continuously differentiable on (a, b) and satisfies the 
conditions 

f(x) €Cfa, b] and fOa)=c; for j=0,...,n—1 (2) 
and 

fOa)=RMox, f(x), f(x), 5 f"-M(@)) for xE€{a, ). (3) 
Remark 1, The space C{a, z was introduced in Snbsec. 2.5.2. For the time being, f(/)(a) 


can be obtained only for x€(a, 6). f(x) € Ola, 6] means that fe (z) has a limit on te ak 
ataand a limit on the left at 6 and that one sets f(a) =lim f(a), fO() =lim f(a) 
xz+a x 


(Fig. 4.2). Problems of the type (2), (3) are called initial-value problems (Cauchy problems) 
for ordinary differential equations of order 2. 


Remark 2. If 7 =1, the right-hand side of Eq. (3) is reduced to f’ =h(z, f(x)). This case allows 
a simple geometric interpretation. At each point (xo, yg) €(a, 6) x Ry, plot a small segment of 
the straight line defined by y—yg=A(2, Yo) - (w—2o). This gives a so-called directional 
field (Fig. 4.3). The problem is to find functions fitting this directional field in such a way 
that the above straight lines become tangents to the curve f(z). 


A 
| /| 
| va 
| i) | | a ! 
tee Oa | 
ES a root 
an er 
| 1 L ak ay I 
) ! ~“ | a 
aS | eee Fig, 4.2 : ) Itiee, 3 
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Problem 2 (Systems of ordinary differential equations of order 1). Let hj(aq, 245 «+5 
Xn), where J=1, ..., n, be continuous on [a, b]X Ry and satisfy the Lipschitz condt- 
tion 

Wepl@igg Wis 2+) By) — Aj (@o; Li, ey He) = le ae | oe eae) 


£9 ; ; 
Lt. 4.2. Existence and Uniqueness Theorems 31 


for all xg€[a, 6], (ry, +> Tn) € Ry and (a, ..., rZ)E Ry. Here M is a positive number 
Ue Permlamo) oot, tye, +9, ANE) =I, ..., 0. Mupther letes€ fy, 7 =1, 2, 2: 
Pind a set of functions f(x), «5 fu(x) which are continuously differentiable on (a, b) 
and satisfy the conditions 

Piece Che, >| ward fi@j=ec; for 7=1,....™% 
and 


f=) e a fut) form j=l,.4,2 and xe(a, b). 


a2. Existence and Uniqueness Theorems 


4.2.1. Systems of First Order 


Problem 2 of Subsec. 4.1.2. is solved by reducing it to the problem considered 
in Subsec. 3.2.6. 


Theorem. Problem 2 of Subsec. 4.1.2. has a unique solution. 


4.2.2. Differential Equations of Order 


Theorem. Problem 1 of Subsec. 4.1.2. has a unique solution. 
Remark. Problem 1 is reduced to Problem2 by setting f=/;, f/=fy. .... f@—-D=f,. Then the 


differential equation (4.1.2/3) is equivalent to the system of order 1, 
fi=fe =f frt=tn» 
Ss fy(2), sees i) : 


with the initial conditions f/;(a)=c;_; for j=1, ....v. This means that ordinary differential 
equations of order » ean be regarded as special systems of order 1. 


4.2.3. Local Existence and Uniqueness Theorems 


For the sake of simplicity we shall restrict ourselves to f’ =h(x, f). Till now we 
have required that h be continuous on D(h)=[a, 6] x By Now we only require 
that h(x, y) be continuous on an open domain 2 of the x,y-plane. The directional 
field referred to in Remark 4.1.2/2 is then meaningful only on 2. 


Problem. Let h(x, y) be continuous on D(h) = 2 and satisfy the Lipschitz condition 

[A(x, y) —h(x, y*)l SM ly—y*l 
for all points {r, y}€Q and {x, y*}€Q. Here M ts a positive number independent of 
x, yand y*. Further let 

sup A(x, yl=N. 

{z,y}E2 
For fa, c}€Q, find a solution of f(x) =h(x, f(x), with f(a) =e, in a neighbourhood 
on the right of a, (a, 6). 
Theorem. Let (b—a) M<1, and let Q be the (closcd) rectangle plotted in Pig. 4.4, 
with QQ, then the problem has a unique solution in Q. 
Remark 1. This is a local existence and uniqueness theorem. ‘The method can however be 
appliediteratively (sequence of rectangles of the form Q), and in this way the integral curve 
f(z) ean be maximally continued (Pig. 4.5). 
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‘Remark 2. A corresponding theorem holds for differential equations of order » and for 
systems of order 1. 


d. Elementary Functions and Power Series 


5.1. Exponential Functions and Power Functions (Real- Valued) 
5.1.1. The Function e* 


Definition. e(@) ts the solution, being unique on Ry, of the differential equation 
f(x) =f(x) with the initial condition f(0) = 1 (Fig. 5.1). 


Remark 1. In the sense of Subsec. 4.1.2., h(a, y)=y. Hence, according to Theorem 4.2.2, 
f(x) =f(x) with f(0)=1 has a unique solution in the interval [—N, N]. If we let N +o. it is 
observed that this proposition also remains true for R, instead of [—N, N)]. 
Theorem. (a) e(x) 2s positive, strictly monotone increasing and convex on R. 
Further, 
e(r)>co as 4>oe, efx) +0 as x>r—-. 
(b) For all xe Ry, and ye Ry, 


e(x+y) =e(x) e(y) . (1) 
Remark 2. A real-valued function is called convex if the line segment connecting any two 
points of the curve lies above the curve. A real-valued function is called concave if the line 
segment connecting any two points of the curve lies below the curve (Fig. 5.2). 


convex concave 


Remark 3. The elegant proof of the theorem uses nothing but the existence and uniqueness 
theorems stated in Chap. 4 for differential equations: what a pity that proofs are omitted 
in this book! 
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Remark 4 (notation). We set e(1)=e. Then from (1) it follows that 
e(n) =e(1) e(n —1) =... =(e(1)}"=e" and e(—n)=e-”, 


where » is a natural number. This justifies the notation e(«) =e”, beeause e* has the previous 
meaning for integral values of x. 


5.1.2. The Function log x 


Definition. log x ts the tnverse function of e*. 


Remark 1. According to Subsee. 3.1.4. and Theorem 5.1.1, the definition is meaningful. The 
curve of log x is obtained by refleetion of the eurve of e* with respeet to the line y=z 
(Le. eB). 


Theorem. (a) The domain of definition of log x 1s D(log x) = (0, <). The function 
log x has derivatives up to arbitrary order; it is strictly monotone increasing and 


concave. Further, (log ae log 1=0, logr+e as r+, and logr> —o as 
0: ae 
(b) For all xc>0O and y=0, 
log (cy) =log x +log y. 
Remark 2. In particular, log 2* =k log x. where k is an integer. 


ate 


. Ie 
Lemma. lim 
Zl 7— 


Dell este The Number ¢ 


tL co 


1 n 
Theorem. e= lim (1 +4) f 


Remark. e=e(1) has the meaning stated in Subsec. 5.1.1. From Lemma 2.1.3/3 it follows 
that e=# and 2<e<4. One finds that e=2.718... 


5.1.4. The Functions a” and logy x 


Definition. (a) For a>0 and x€ Ry, one sets a® =e* 84, 
(b) Let a>0 and a+1, then logyc ts the inverse function of a”. 


Remark 1. For a=1 one has 17 =a7=e9=1. For a>0 and a+1, a* is strietly monotone. 
Hence there exists an inverse funetion, whieh ean be obtained by the reflection prineiple 
(Fig. 5.4). For a=e one has log, e=log x. From now on we will write log x=In x. 
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Remark 2. One has a!=eln@=a. Further, a* has the usual meaning if k is an integer. This 
justifies the notation. 
Theorem. (a) a+" =a?a", a%b® = (ab)®, (a*)¥ =a for a>0, b>0, ae Ry and ye Ry. 
(b) (a*/ =a* Ina for a>0 and re hy. 
(e) logy (ry) =loga x+log, y for r>0, y>0, a>0, a+1. 
(d) logar=—" logy x for «+0, a>0, b>0, a+1, b+1. 


Remark 3. The curves a”, where a >0 and #¢€ R,, cover the upper half-plane without gaps, 
except for the set {(x, y) | s=0, 0<y<o, y+1}. If for instance x>0, then a* > as a> 
and a? +0 as a| 0 (Fig. 5.5). 

a* a>] 


5.1.5. The Function x*% 


According to the above considerations, x” is defined for x>0 and yé Ry. So 
far we considered x=a fixed and y variable. Let us now fix y=a€ R, and in- 
vestigate the function x%, with the domain of definition D(x) = (0, ) (Fig. 5.6). 


Mheorem. (a) x2 Oy — (ay)? a or nena or era, 
BER. 

(b) Lf a>0, then x* is strictly monotone increasing, x% > co as x-> 00, and x* | Oas 
x} O.Lf «<0, then x% is strictly monotone decreasing, x* | Oas x>0, and x* > as 
x} 0. Further, e®=1. 

(cy Forz=Oanl een, (7) =o =. 


Remark J. (a) is a mere transcription of Theorem 5.1.4(a). 


P : : . 
Remark 2. If «=—=0 isa rational number (p and qg being natural numbers), we may also 
Cea ae aes 
write #%= Vx?, and further, «2 =Vx eS Finally, one sets 0%*=0 for «=>0. 
a 
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5.2.1. The Funetions sin 2 and eos x 


Definition. (a) sin x ts the solutton, being unique on Ne i the differential equation 
f' (x)= —flx) with the initial conditions f(0)=0, f’(0) 

b) cos x is the solution, being unique on Ry, of - diferent equation f’ (xc) = 
= —f(x) with the initial conditions f(0) =1, f’(0)= 


Remark J. In the sense of Subsec. 4.1.2.. h(x, y, z)=y. Hence, according to Theorem 4.2.2, 


f{"(x) = —f(x) with /(0) =0 and f’(0) =1 has a unique solution on the interval [—N, N]. If we 
let V —-, it is observed that this assertion remains valid for R, instead of [—N, N]. 


Remark 2 (geometric interpretation). Later on we will calculate the length of smooth curves 
L. Tf, for instance, Lis the unit circle (circle of radius 1), then one considers the length of 
inscribed polygons (Fig. 5.7). The supremum of these lengths is finite, being denoted by 
2x. One obtains 7=3.141 ... The method is also applicable when the length of the are PoP, 
shall be determined, which is denoted by x (Fig. 5.8). Consequently, 0=sx=2r. Further, 
/(x) and g(x) have the meaning indicated in Fig. 5.8, where the signs shown in Fig. 5.9 must 
be taken into account. 


A Be g(x) 
NY x . 


<a Fig. 5.7 Fig. 5.8 


above: a 
e\\ 


SS 
SS 
ou 


belcw: sign 
of F(X} 


Fig. 5.9 

Lemna. For 027 = 2n, g(x) =sin ¢ and f(x) =cos zx. 

Remark 8. To prove this, it is shown that f(x) and g(a) satisfy the above differential equa- 

tions and initial conditions. 

Theorem. Let re Rh, and ye ky. Mh 
(a) sin x=sin (y+2n) (Fig. 5.10), cosr=cos(xr+2n) (Fig. 5.11) (periodic 

functions). 


COS X 

Fig. 5.10 NO Fig. 5.1 
Ol x x 
2 


(b) sin =O if and only tf x=hr (where k is an integer); cos c=0 if and only éf 


T™ : 2 
a=>tkn (where k 1s an integer). 


(e) sin x= —sin (—x) (odd function with respect to 0), cos x=cos (—.r) (even 
function with respect to 9). 
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d) sin?a+cos?z=1 for re Ry. 


( 
(e) sin x=cos (<-§) ; 
( 


dad 
ft) (sin)’=cosz, (cos x)’=—sin2. 
(g) sin (r+y)=sin x cos y+sin y cosa, cos (r+ ¥) = COS & COS y—sin x sin y. 


2 


Remark 4. The theorem is proved using the existence and uniqueness theorems of Chap. 4 
as well as the above lemma. 


Remark 5. In particular, {sin z{=1 and |cos a| =1. Further it is concluded that sin (w7+7)= 
= —sin x and cos (w%+7)= —cos &. 


5.2.2. The Funetions tan x and cot x 


eo 


See Siete : see ; T 
Definition. tan x= , with thedomain of definition D(tan x)= U ar kr, 


with the domain of definition D(cot x)= 


T COS x 
=+hk Big, 5.12); r= 
5 4 x] (Fig Ve Gn ee cae 


2 eR eee Ce eae 
k=—co 


Theorem, tan (r+x)=tanz, cot (x+7)=cot 2, 
(tana) = (eetanee (cong) —— cous 


where x€ R, is any point of the respective domain of definition. 


Doosan The Funetions aresin # and arctan x 


amere : P : ; ; Tt ce ee 
Definition. arcsin x 7s the inverse function of sin y, with —><y~=>Z (Hig. 5.14). 


“ 


: é Q : ™ ao : 
aretan x 7s the tnverse function of tan y, with -s<y<> (Fig. 5.15). 


Remark 1. sin y and tan y are strictly monotone on the interval ( - 


. The inverse 
functions can then be constructed by the reflection principle. 


arcsin X 


arcian x 


Theorem. D(arcsin x)=(—1, 1), and D(arctan x)= Ry. Further, 


Sy ae 1 ; 
(aresin x) sae and (arctan 2) See 


where x ts any element of the respective domain of definition. 


Remark 2. By analogy. areeos x and areeot x ean be introdueed as inverse funetions of 
cos y and eot y, respeetively, on monotony intervals. 


5.2.4. The Function ei? 


The are length along the unit circle is denoted by @ (ef. Subsec. 5.2.1.), 
O=¢=2n. Let z(y)=cos p+isin ¢ (Fig. 5.16), where this complex function with 
the period 2x is extended to the whole R,: 2(y)=z2 (y+ 2kn), k being an integer, 
pe ky. 

Lemma. z(o+y)=2(¢) 2(y) for p€E Ry, and pe Ry. 

Remark 1. The lemma justifies the notation 2(y) =e!”. One finds e? =e?! =1 and ei =e 7 — 
=-l. 

Remark 2 (representation of complex numbers in polar eoordinates). Let z=x+iy be a 
eomplex number, then the distanee to the origin is denoted by r=|z|. One has r?>=2?+y?, 
and hence, using the previously introduecd ealeulating rules, one finds r=|z|=)Va?+y?. The 


Z ? : 
eomplex number ms lies on the unit eirele (provided that r+0; Fig. 5.17) and ean thus be 


represented as e!?, i.e. z=re!” (representation of eomplex numbers in polar eoordinates). 
Here one may substitute p+2kz, with any integer 4, for gp. Some of the ealeulating rules 
for eomplex numbers are simplified by the ineeodaenon of polar eo-ordinates. If e—rel?, 
then Z—re7!®. If, additionally, w=se'¥, then 

z+ w=rsell?ty) , 
Ceometrieally, the multiplieation by # means a dilatation by s=|w|, combined with a rota- 
tion by the angle p (Fig. 5.18). 
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zw 


p rs 
y +p g & 
f Fig. 5.18 


Remark 3 (Moivre’s formulas). The representation given above can also be utilized to cal- 
culate identities for trigonometric formulas. Let x be a natural number, then 


cos nv +i sin nx =e!" =(e!”)" =(cos x+i sin x)”. 


Calculating the right-hand side and comparing the real and the imaginary parts, one finds 
that 


n = 5 mn ah . 
cos NX = cos” «-(3) cos"~2 x sin? «+ (4] cos"—" x sinta—..., 


sin nz=n cos?—! »-(5) cos"-3 x sinte+(5) Coss”? @ sin? vw —... 


! 
Here (;) Steen are binomial coefficients, for 1=k=n. 


5.3. Exponential Functions and Power Functions (Complex- Valued) 
5.3.1. The Functions e* and In z- 


In this section we will consider complex-valued functions f(z), with the com- 
plex plane C, or C,—{0}={w | weC;, w+0} as their domains of definition 
D(f(z)). As usual, z=a2+7y=Rez+iIm z=re'?, where the last expression is the 
representation in polar co-ordinates as given in Subsec. 5.2.4. 


Definition. (a) e*=e%el’, with D(e*)=C;,. 
(b) Inz=Inr+ig, 0=p<2r, with D(In z)=C,— {0}. 


Remark. e”, e!¥, In r for r>0 were introduced previously. 


Riemann surface for In z. One has 


3 Ae aan 
= rel? = rely + 2kxi , 


where k is any integer. The restriction to 0 = p< 2r in the definition of In z appears 
artificial. However, if the definition of In z is extended to m¢ Ry, one obtains a 
many-valued function, which is undesirable. To avoid this dilemma, one intro- 
duces Riemann surfaces. In the case of the function In z this is done as follows (Fig. 
5.19): an infinite number of complex planes C, are laid one on top of another, 
numbered by k=0, 1, —1, 2, —2, ..., slit along their positive real half-axes, and 
the cutting edges of these sheets are cross-connected to one another according 
to the instruction given in Fig. 5.19. The integers indicated in Fig. 5.19 are the 
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sheet indices, and the pairs of eorresponding numbers arranged one above the 
other show from which sheet to which other sheet one has to move when cros- 
sing the slit. Let w be sueh a path, then this means that, when moving from A to 
B, one changes from sheet No. 0 to sheet No. 1 or, generally, from sheet No. & to 
sheet No. &+1. Conversely, when moving from B to A, one ehanges, for exam- 
ple, from sheet No. 17 to sheet No. 16. Now the convention is made that in the 
k-th sheet z shall be represented by 


2=re*, “Where 2ir=p<2 (k41)r. 


On this Riemann surface it is now possible to represent 
In z=In r+ig, pe, r>0, 


as a single-valued function (0 does not belong to the surface). If one moves 
continuously along the Riemann surface, the (single-valued) function varies 
eontinuously in z. 


Theorem. (a) For z€C, and w€C,, ee’=e?*” . 
(b) In z ts the “inverse function” of e’ in the sense that e™=z for all z+0, 
independently of the particular sheet selected. 


5.3.2. The Functions 2”, Riemann Surfaces 


Till now the funetion x” has been defined only for x>0 and y€ Ry. 


Definition. Let z and w be any two complex numbers, and let z+0. On the Riemann 
surface for In z one sets 2” = en, 


Remark 1. If w is fixed, then the domain of definition of the function f(z) =2” is the Rie- 
mann surface for In z. If (z”); is the value of the function on the k-th sheet, then using 
0=9<2s and Inz=Inr-+ip one obtains 


(=), =e¥lnr+wigt wilkr a, elwltr | . (1) 


Remark 2. Let <=x>0 and w=y€ Rj, then one obtains (z”),=e4!"* =a¥. This justifies the 
above definition. 


0; +, 


Lemma. Let w,; and w, be any two complex numbers, then 2\2=z on the 


Riemann surface for In z. 
Remark 3. One finds (Fig. 5.20) 


ii (+ 2kr) = 5 ~2kr 


paelsi_e =e 


Consequently. on the 0-th sheet one obtains ie 2, an interesting formula! 


Remark 4. If w =~ is real, then |c!#2**|=1, Then it follows from (1) that the values (2%), lie 
on a circle of radius |(2%)y, (Fig. 5.21). 
(Ze), 


Fig. 5.20 
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enn 8 : # : : . 
Remark 5. Of special interest is the case where ere is rational, p being an integer and qa 


natural number, p and q coprime. If J and & are two integers with /=k-+ gn, n integer, then 


a # Dp 
(1) implies that (2%), = (2" \z4nq- Hence 2 7 has only q different values, which may be de- 
Dp dD Pp “ 
noted by (2% )p, (22). «+» (2%)g—1- Consequently, to represent 2 7 asa single-valued function 


requires not the whole Riemann surface for In z, but only q sheets of it, e.g., the sheets with 
the indices 0, 1, ..., g—1 (Fig. 5.22). From sheet No. g—1 one returns to sheet No. 0, as is 
shown by the imestinarettonn indicated in this figure. The surface of existence (domain of defini- 


if — 
tion) of 2 =z? then consists of 2 sheets; for example one has (Vi )p=1 and (V1),=e™=—-1 
(Fig. 5.23). 
pe ee ig. 5.22 His. 5.23 
0 7-9-2 g-T 0 


5.3.3. Roots of Unity, Fundamental Theorem of Algebra 
a 
Definition. Let n be a natural number, then the complex numbers 1" are called the 


n-th roots of unity. 
oie 


hi 
Lemma. There exist n different n-th roots of unity, namely e” , where k=0, 
n—1. 


Remark1. The lemma is an immediate consequence of the considerations given in Sub- 
sec. 5.3.2. The roots of unity define the corners of a regular polygon. 


Theorem 1. The polynomial 2” —1 has exactly n different zeros in C, (v.e., complex 
numbers z which satisfy 2” —1=0). These zeros are the n-th roots of unity (Fig. 5.24). 


ee Bie ot 


Remark 2. The theorem is a consequence of the above considerations. Now the question 


n 
arises whether the situation is similar for arbitrary complex polynomials of degree n, 2, a;2’, 
n 730 
a; complex. We may suppose that a, +0. Here 2, is called a zero if 2 ajz = 0. 
j=0 


nr 


Theorem 2 (fundamental theorem of algebra). >) ajz , a; complex, dn +0, has exactly 
7 =0 
n complex zeros. Let 21, ..., 2, be these zeros, then 


nN 
a Ge —wy (2 —2,) (22) le — 2) - (1) 

fea 
Remark 3. It may happen that some (or even all) of the zeros coincide. If, for example, 
2 =2o=...=2,, but 2,+2, for k=r+1,...,n, then z, is said to be a zero of multiplicity (or 


order) r. If >} ay has exactly! different zeros wy, ..., w with the multiplicities nj, ..., nj, 
j=0 
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then (1) becomes 


n 


Dag ay (2-204)! (2009)? ... (2-7), (ap £0) - (2) 
7=0 


Here the x,’s are natural pyaar and 2,+2o+...4+2;=n. Hence, taken into account the 


multiplicities, the polynomial > aja, a; complex, a, +0, has exactly n complex zeros. 

a) 
Remark 4. The proof of the theorem is relatively easy if it is supposed that the polynomial 
has at least one complex zero. The best way to verify this assumption i is in terms of complex 


function theory. cf. Chap. 15. We shall return to this point in Subsce. 15.3.6. and in Re- 
MNES Mae, 


Theorem 3. Let a;, 7=0, ..., n, be real numbers, and let a, +0. 
nh 


" (a) Tf wis a zero of multiplicity m of the polynomial >; ajz/, then w is likewise a 
zero of the same multiplicity. iat 

(b) If wy, .., ur are the different real zeros with the multiplicities ny, ..., rs 
and if V4, U4, Vo, bg, ..., Ue, By are the different BeOS complex zeros (i.e., Im v, +0) 


with the multiplicities m,, ..., mg for the polynomial S az), then 
a 
n 


Dae —ane 1) 8 a) ea —Ree) Him o,)2) (3) 
j=o 


. [(2—Re vg)2+ (Im 2,)2]"* . 


Remark 5. If z=~ is real, then (3) is a purely real decomposition, which is desirable for real 
polynomials. 


sede Power Series 


5.4.1. Radius of Convergence 


co 


Let us consider power series P(z) = 2 a; (z—z9)/ defined on the complex plane 
joo 

C,. Here the coefficients a; are complex numbers, zo€C; is fixed and z€C, is 

variable. Trivially, the series converges for z=z). For what other values of z 

does the series converge on C,? Of special interest is absolute convergence, i.e., 


oo 


~ las] lz -zgl?<oo. (1) 


Let {c;}729 be a sequence of real numbes, the lim c; (apper limit) denotes the 
largest, ‘and lim c; (lower limit) denotes the smallest accumulation point of the 
numbers c;. With a; denoting the coefficients of the above power series, one sets 
if 
R= (lim |a;|’)~! (radius of convergence), (2) 
i 
where the convention is made that R= > for lim |a;,|7 
i 

lim |a,| 7 = 0. 


=0 and R=0 for 
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Theorem. For |z—z |<, the series S aj (2-2)? converges absolutely. It does 
7=0 


not converge absolutely for |z—zy|> R. 


Remark 1. The theorem justifies the term “radius of convergence” for &. Whether or not 
(1) is true for |z —z9| = R cannot be stated gencrally. Either of the two cases may occur. 


Remark 2 (examples of power series and associated radii of convergence). 


co 


(2) emer lamer ol (bp) 2) 7 andl itt. 
7=0 i 
(cl > jf and k=. 


ws. 
—_ 


5.4.2. Addition and Multiplication of Power Series 


Let > a,j(z— 2%)? and 2 b;(z— zo)! be two power series with the radii of conver- 
j=v0 j= 
ence f, and fi), eapettvely 


Theorem. (a) /f A and uw are complex numbers, and if R is the radius of convergence 
of >, (Aaj + mb; (2-29), then R=min (Ry, Rp). 
f=0 


(b) If R is the radius of convergence of the serves 


2, ty (2-2) y| |= 2, 0; (z— 9) y|- 2a (c4ybj + 4b; —1 + +. +.45bp) (2-20), 


j=0 g=0 


then R2=min (Lg, Lp). 


5.4.3. Differentiation of Function Sequences and Power Series 


In this subsection we consider real functions f€C[a, b] with —o<a<b<-—, 
As previously, f’ €C[a, b] means that f is continuously differentiable on (a, b) and 
that /’ has a limit on the right at « and a limit on the left at b. The symbol > 
denotes uniform convergence on [a, 6]. 


Theorem 1. For j=1, 2,3, ..., suppose that f;¢C[a, b] and f;€C[a, bj, and that 
fi>f and f;>g. Then f€C[a, b] and g€C{a, b]. Further, f ts differentiable on (a, b), 
“aud | =. 


Theorem 2. For J=1, 2,3,..., suppose that he Cla, b] and f; ¢C[a, b]. Further 
let Za f(x) and 2 f;(x) converge uniformly. Then 2 f;(x) ts differentiable on (a,b), 
= j= 


and 


a t 
( Bs pa) = 3 =D fi(x) (term-by-term differentiation). 
j=l j=l 
Remark 1. We regard real power series, > a; (% — wy)’, a; real, as cut-outs of complex power 
: a! 
series, ft is again the radius of convergence, which now means absolute convergence on the 
interval (%— RB, a+ R) (Fig. 5.25). 
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Theorem 3. Let f(x) = >) aj (wx — 2%), a; real, with R>0, then f(x) hus derivatives wp 


j=0 
to arbitrary order on the interval (xy— R, x9 + R), and 
fO(x) = > FF GRD @ (2@—a9)'-# (1) 
j=k 


(term-by-term differentiation) for k=1,2,3,..., where all of the above series 
converge absolutely on (x y— R, xg + &). 


‘ 


Remark 2. There arises the question whether the converse of the last theorem is also true: 
consider a function which has derivatives up to arbitrary order on (%—R, 29+). Can 
every function with this property be expanded in a power series? In general this is not true. 


5.4.4. Taylor Series 


The question put at the end of Subsec. 5.4.3. shall now be investigated more 
closely. Setting r=2x, in Eq. (5.4.3./1), one obtains {f(a 9) =a,k!, and hence 


<  ) 
2 TH 


This expression is called a Taylor series. The question put at the end of Subsec. 
5.4.3. now reads: when does the Taylor series converge, and when does it 
converge towards f(x) ¢ 


(x—2p)* (setting O!=1). (1) 


Theorem 1. Let —~o<ag<x,;<-. Further let k=0, 1, 2, ... Suppose that the func- 
tion f has continuous derivatives up to the order (k+1) on (2x9, 2), where 
PVC ntl Gmg =O, ko ld nen ; 
ae van. * 

f2)= EY (eae f ah My) dy (2) 
=) : ae 
for all x with xy<xr~<x,. Further there exists a function 6=0(x) with 02881, 
such that 


ketG ip Gr, Wore 
fle) = DEY Co — yy SE $0 (20 49 a0) (3) 


for all x with xry<x<x}. 


Remark 1. The last summand in either of the formulas (2) and (3) above is ealled remainder. 
Thus the question for the validity of (1) reduees to the question of whether or not these re- 
mainders approach zero as k > (provided that f has derivatives up to arbitrary order). 


Remark 2. If { has continuous derivatives up to the order k+1 ona neighbourhood (xp —é, 
Xo +e), €>0, then (2) and (3) hold for x€(xp—€, % +e), where in (2) for <2 one has to te- 


F x Xo 
place f by—f .- 
vo x 
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Theorem 2. Let —o<x,;<a,<o. Let the ae f(x) Be derivatives up to 
arbitrary order on (x1, X9), and let rE (x, Xo) and xy € (a4, Xo). Lf 


SS rs ee Oa ae ees | (4) 
ta ee! 


where I(x, x)= (x, z) for ry< 2 and I(xo, x)= (x, %) for r= x, then f(x) can be 
represented by (1). 


Remark 3. This theorem is an immediate consequenee of (3). 


5.4.5. Examples of, and Counter-Examples to Taylor Series 


Theorem 1. On R,, e®, sin x and cos x can be expanded in absolutely convergent 
power series. For x€ hy, 
1 oe eee ge ge 


r= 2 Se ain eae a teat sun gOS Op es SE oh 


Remark 1. In terms of Theorem 5.4.4/2, one has a=0, and (5.4.4/4) applies. 


Theorem 2. Let «=0, then 
ace ; _ Ine . 
lim —=c, lim e-*2*=0, lim ——=0, end lim x? |Inal= 
Loo yr Loco Bro wy 210 


Remark 2. The first proposition follows from Theorem 1; the other propositions are derived 
from the first one. 


Theorem 3 (counter-example). The function 
i 


fee)= {5 i er Fig. (5.26) 


has derivatives up to arbitrary order on Ry, but cannot, at the point 0, be expanded in 
a Taylor series with a positive radius of convergence. 


e 
Remark 3. For the f(z) of Theorem 3, f(a) = es Ca: ) for «>0, where P;(a) is a poly- 


nomial in x. Theorem 2 then shows that f has denneiees up to arbitrary order at the point 0 


and that f*)(0) =0. This implies however that (5.4.4/1) with a)=0 eannot be true in a neigh- 
bourhood of 0. 


5.4.6. Power Series for e*, Analytie Funetions 


It turns out that the considerations on real functions, as described in the 
preceding subsections, can be extended to complex functions. In Chap. 15 we 
shall deal with the subject systematically. Some statements can however be 
made already now. 
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Definition. Let G be an open domuin in the complex plane C,. A complex-valued 
function f(z) 7s called analytic on G if at every point 29€G4 it is possible to expand 


f(z) ar an absolutely convergent power series >, aj (z—29) with a positive rudius 
of convergence. j=0 


Theorem. e* 7s an analytic function on G=C,, and 


2 
Sn 


forall z€C,. 


Remark. The other complex functions considered previously are analytic, too, provided that 
the above definition is slightly modified. Thus In z and 2” are analytic on the Riemann sur- 
face for In 2. 


5.4.7. Irrationality of e 


A real number which is not rational is called irrational. 


Theorem. e 7s an trrational number. 


1 
Remark. The proof is based on the representation e= > mm 
kau 


6. Banach Spaces 


Oa Definitions and Examples 


6.1.1.  Definitious 

The concept of a vector space is well known from analytical geometry: it is a 
nonvoid set M in which two operations are defined, namely addition of arbi- 
trary elements of M and multiplication of elements of M by real or complex 
numbers. The elements of M must have the following properties with respect 
to addition: 


Ty yee (commutativity) , 
EOL (existence of a neutral element 0 € M), 
(ce+y)+z=a+(y+z2) (associativity) . 

Here x, y and z are arbitrary elements of M. Further for every pair of elements 


x€M and y¢éM there must exist a unique clement z¢€M such that r+z=y. 
With respect to multiplication by real or complex numbers, the elements of M 
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must have the following properties: 

Maty)=Axtay, (Atp)e=Art px, 

re A) aa 
Here «<€M and yéM, and 4 and wp are real or complex numbers. If only real 
numbers are admitted for the multiplication, then M is called a real vector 
space. If complex numbers are admitted for the multiplication, M is called a 
complex vector space. From now on we shall write 0 instead of o. Further we 
write z= —y if y+2z=0. 
Definition 1. A (real or complex) normed space ts a (real or complex) vector space M 
on which a function is defined which assigns to every xe M a non-negative number 
|x| having the properties 

1. |x|] =0, where ||x|| =0 af and only tf x=0, 

2 axl, 

3. ly +a] El|lal]|+llyl| (riangle inequality). 
Here x€ M, y€M, and A is real or complex. ||x|| vs called the norm of x. 
Lemma. With o(x, y)=||x—y\|, a normed space becomes a metric space. 


Remark. The lemma is a consequence of Def. 1.3.3/1. From now on we shall regard normed 
spaces a8 special metric spaces. 


Definition 2. A (real or complex) Banach space is a (real or complex) normed space 
which (regarded as u metric space) is complete. 


6.1.2. Examples 
The space R,. With lel a (ail) tor @—(2))2) nq) ttn ie & Banach space, ci. 


Lemma 2.5.1/2. Here fine. adaiien Ole = (a) 2, dy (ee) ee Well 
as the multiplication by real numbers 4 are defined in a natural way: ¢+y= 
= (Cane Tie Oday) In + Yn)» Ax = (Ax, O00 Arn). 


The space C,. If one considers complex sequences x= (ay, ..., ¢,) and multipli- 
cation by complex numbers /, then C, is a Banach space with respect to the 
norm ||z|| defined above. (This is a slight modification to Subsec. 1.3.2.) 


The space C[a, b]. With |\flea,o) = ay If(x)l, the space Cf{a, b] defined in Sub- 
ela 


sec. 2.5.2. is a Banach space; cf. Tocco 2.5.2. Here the addition of functions 
and the multiplication by real numbers are defined in a natural way: 


(f+ g(x) =fx)+ g(a), (Af)(&) = Af) 


6.2. Spaces of Type l, 

6.2.1. Inequalities 

Lemma. For x>0, y>0, 1=<p<- and 1/p+1/p’ =1, the following relation holds: 
xy SxPip+y? |p’. 

Remark 1. p’, with 1/p+1/p’ =1. is also called the conjugate number of p. 


Theorem. Let x1, ..., % and Yj, «5 Yn be complex numbers, and let 1<p<- and 
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i/p + 1/p’ =1, then 


n = ne tip fn Le 
Sone 7 lxjl yl s p in?) 2 nl") (1) 
= 


j=! a j=l 
and 
n {/p n {/p n t/p 
eo Inj) 2(2 sl?) +(3 wl") (2) 


Remark 2. (1) is called Hélder’s inequality, (2) is called Minkowski’s inequality. 


Pf 
6.2.2. The Spaces Uj.rs Ub,c, lpr, ad lp,c 


Let us consider vector spaces whose elements are finite or infinite sequences 
x= (a4, %, -..) of real or complex numbers. The addition of r=(24, %, ...) and 
Y=(Y4; Yr, ---) is defined by x+y=(x1y+Yp %2+Yy, -.-), and the multiplication by 
real or complex numbers / is defined by Ar= (Ax, Ama, «..). 


Definition 1. (a) Let n be a natural number and let l= p<, then lpr (or Upc) ts 
the real (or complex) vector space of the real (or complex) sequences x= (X4, +, Xn); 


n i/p 
with ||x||)n= ( = int . 
Pp al 
(b) Let l= p<e, then lp (or lp.¢) ts the real (or complex) vector space of the 


oo 1/p 
real (or complex) sequences x= (24, X9, ..) wrth |lal., = ( = sl?) < co, 
j=l 


Remark 1. According to Subsecs. 1.3.1. and 6.1.2., R,=U,z and Cy=Ti\¢. 
Lemma. Let n be a natural number, and 1=p<~, then, forall xélic ; 


Nell nS nla ered (lpn lla - 
il P Pp 1 
Remark 2. Letting n ~~ in the second inequality, one obtains lel, S|la\lr, for all x€ly ¢. 


Definition 2. Two norms |\z\|, and |lallp in a vector space M are said to be equivalent 
if there exist two positive numbers c, and cy such that 


€4\z||,Sllallo Sealla|l 
for all xe M. 


Remark 3. The two normed spaces of Def. 2 are regarded as special metric spaces with the 
metrics 0,(x, y)=|z—ylly, J=1, 2. Cauchy sequences with respect to g, are also Cauchy se- 
quences with respect to 99,and vice versa. An analogous statement can be made for conver- 
gent sequences. Thus the norms ||z||; and ||z||) are fully equivalent with respect to convergence 
behaviour. 


Remark 4. The above lemma implies that all norms Ilell)n with 1=p<- are equivalent in 
Pp 


It Rr or in Vie: 
Theorem. (a) If n is a natural number and 1=p~<-, then lyn ws a real Banach 
space. If 1=q<, then llall,n is an equivalent norm in ly pr 

p 


(b) [fnisa natural number and 1=p~<-=, then [h.¢ is a complex Banach space. 


If 1sq<~, then ||al|,, vs an equivalent norm 11a 
q 


(ce) If 1=p<~, then ly p is a real, and ly¢ vs a complex Banach space. 
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Convention. A, can be normed using one of the equivalent norms ||2'|, n From 
now on we make the convention to set 


12 
Iz lle, = (> joie) , therefore h,=li pz. 
j=l 
An analogous convention is made for C,. Further we write 7} instead of J p or 


Ic because it will be clear from the context whether a real or a complex space 
is concerned. By analogy we write /, instead of 1, p or 1,,¢. 


re Integrai Calculus in R, (continued) 


tolls Classes of Integrable Functions 

7.L.1. General Rules (Integration by Parts, Change of the Variable) 

Theorem 1. Let —~o<a<b<o. Iff, f’,gandq’ are elements of Cla, 6], then 
F iy'de=/) 90) —Hla) le) foe. 


Remark 1. This rule is called integration by parts. It can sometimes be used to reduce un- 
known integrals to known ones. To give an example, let 0<a<b<~, and let « be real, 
a+ —1, then 


b b b 
pathy! yeti gee 
i 2* In bie If Ina da= ihn gel — / —dz. 
ee ap 1 a+] jl spill & 
a a a 


Here one sets h(x 2/2 =h(b) —h(a). The integral on the right-hand side is already known: 


b (tan 
| ee EE |b, for B+ -1 (1) 
@ inners itor i= il 


Change of the Variable. Let —°0<a<b<~, and let p€OCfa, 6] and »’€Ofa, b] 
with g’(x)>0 for x€[a, b]. Then ¢ is strictly increasing (Fig. 7.1). Let p=o7! 
be the inverse function. Then y€C[o(a), ¢(b)] and wy’ €C[¢(a), p(b)] (cf. Subsec. 
ela. 


“et 
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Theorem 2. Let - b], then g(x) =fly(x)) €Cl[y(a), (db), and 


b 
Uae =i g(x) p(x) dar. 


g(a) 


Remark 2. If pe Cla, bl, pe Cla, _ and ieee ae on [a, 6], then @ is strictly decreasing. Then 
it follows from /€C[a, b] that g(x) =f(y(x)) €C[@(b). p(a)] and 


g(a) 
Ft ae Oe a eee 
¢(b) gb) 


7.1.2. Integration of Rational Funetions, Partial Fraction Decomposition 


Quotients of polynomials are called rational functions, 


n 
Ab 


Ss dp 
a et 
E m 2 
Q(x) > bx! 
Lau) 


where, for the time being, a; and b; are assumed to be arbitrary complex num- 
bers wieneu.,-- Ol urther leti7— 0, 1,2)... and m=0, 1,2, .... Then, according 
tO (o.onnve |, 

Oa) = Cao.) ! @—a) 2... e— aj), 
where «,,..., %; are the different ne zeros of Q(x), with the multiplicities 
Wis, «<5, 
Theorem (decomposition into partial fractions). (a) There exists a unique repre- 
sentation of R(x) as 


R(x) = 


xe ky, 


My 


oat Se (1) 


atont (x = 
where p(x) ts a polynomial and the Ay» are complex numbers. 
(b) If all the coefficients a, and by of R(x) are i te Gig 11. 
(conjugate complex zeros of Q(x)), then Ar»=Asy for v=1, ..., my. 
Remark 1. (1) is meaningful for all e+, with «=1, ...,j. As regards proposition (b), refer 
to Theorem 5.3.3/3. In partieular one has m,=ms. Further, in this case it follows that d,,, is 
real if «, is real (keeping the assumptions of (b)). 


Remark 2. p(z) and A, ean be determined explicitly, although the procedure is a_ bit 
tedious when multiple zeros are involved. If we multiply (1) by (w—a,)™ and then let 
“%—-%, we obtain 


Aj m,= lim R(x) (2—a,)™ 
Tay 
where the right-hand side ean be calculated by ?Hospital’s rule.Thereafter the method is 


, ‘ : ; Atm: 
applied to the new (less complicate) rationa] function F(x) * eae 


can be calculated step by step. Finally the expression is reduced to the polynomial p(z). 


. In this way A,, 


Integration of R(a). In the integration over intervals [«, b], we have restricted 
ourselves to real functions, so that now, too, we assume that all the coefficients 
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a; and b; in R(x) are real. Here it is further assumed that [a, b] does not contain 
any zero of Q(x). Moreover we try to transform (1) into a purely real represen- 
tation. If x, is real, then it follows from Remark 1 that A,, is real. If a and A 
are complex numbers, with hn «+0, then 


A A cj (w— Re «) +d; 


k 
Gaye eae 1) + 2 eRe ay? im a 


(2) 


where q(x) is a polynomial and all‘coefficients on the right-hand side of (2) are real. 
Part (b) of the theorem now shows that the terms in (1) can be combined in 
pairs, which gives a purely real representation of R(x). Elementary changes of 
the variable, of the form y=ar+b, show that integration of A(x) can be reduced 


1 
to integration of x’, @sip and ame over suitable intervals. Here 7 is an 
integer and / isa natural number. Integrals of x have been dealt with in (7.1.1/1), 
: “4 1 | 
oo eee ae : ae : ; 
J Pegi o =| 3 (in G4 DY dr=s in (+I) Ie, 
c ¢€ 
d ad 1 
: s : 
Se dr= ————— 2 1 k+l) dx 
| G24" | yer et) es 
1 . 
= — a (x? 4 1)-F +! 2 oe 
2(k-1) Cea ) as kh ao: > 
d 
| = =arctan x |? (Theorem 5.2.3.) . 


a 


Finally, the still missing integrals can be calculated by iteration, using the 
formula 


ee seb eae =! (Gee Py i 


c 


d d 

k-1 * 9k -—3 Ae d 

—— dr= | ———-—— dr + —_— |... k=?2,8,... 
if | ( 


To sum up, it is seen that the integration of real rational functions can be reduced 
to that of a few basic types, which in their turn can be integrated by elementary 
rules. 


7.1.3. Integration of R(cos x, sin x) . 


Let us consider rational functions of cos x and sin x, 


oi 
D toy cos eau ¥ 
I kl=0 
R(cos x, sin x) =-=——___——_ 
Da, Come See 
PS 


where «a, and b,, are real numbers. As (cos x, sin x) is periodic with the 
period 2x, the integration can be restricted to intervals [a, b]<(—7z, x) which 
do not contain any zeros of the denominator. Changing the variable by 


Hedi gle 7.1. Classes of Integrable Functions 51 


vo 
y= (x) =tan >, in the sense of Theorem 7.1.1/2 (Fig. 7.2), gives 


b tan = tan ue 
‘ ; p 4) 2dr ae 
R(cos x, sin x) dr= R aie y a | Ri d 
tan < tan = 


where R(y) is a rational function of y. This reduces the problem to the integra- 
tion of rational functions. Using the Moivre formulas of Subsec. 5.2.4., rational 
trigonometric functions of the type 


R(cos nx, sin nx, cos (n—1) x, sin (n—1) 2, ..., cos x, sin x) 


can be reduced to rational trigonometric functions of cos x and sin x, and hence 
be integrated by elementary rules. 


| 
| 
| 
| 
| 
ra 


=e 


7.1.4. Integration of R(e7), R(x, Vx?—1), and R(x, Vx? +1) 


We consider rational functions of one variable (as in Subsec. 7.1.2.) or of two 
variables (as in Subsec. 7.1.3.). 


Integration of R(e*). Substituting y= ¢(x)=e* reduces the problem to the in- 
tegration of rational functions of x in the sense of Subsec. 7.1.2. 


Integration of R (a, ¥x2—1) (Fig. 7.3). Here it must be assumed that |2]=1, 
because we consider only real functions. Let x>1. Substituting x=cosh y= 

eae ae : 
= aes (hyperbolic cosine) reduces the problem to the integration of rational 


functions of e*. 


gees ae 
5 (hyperbolic sine) 


=e : : € 
Integration of R (w, Vx?+1). Substituting x=sinh y=—>— 


reduces the problem to the integration of rational functions of e*. 


ao 7. Integral Calculus in R, Peds 


7.1.5. Integration of R(w, V1 — x?) 


It must be assumed that |a|=1. Substituting r=sin y reduces the problem 
to the integration of rational functions of cos x and sin x in the sense of Subsec. 
7.1.3. (Fig. 7.4). 


ax +b\" 
7.1.6. Integration of R @ nat 
cx +d 


Let 2 be a natural number. Further let a, 6, ¢ and d be real numbers, with 


a+b ee 
ad—bce+0. Let Po >0O on the integration interval. If c=0 (we can then 


Coster il 
assume that d=1), then substituting y=(ar+b)” reduces the problem to the 
integration of rational functions in the sense of Subsec. 7.1.2. (Fig. 7.5). If 
at+b a  ad—be 
i) 8 Ae 
gration interval is included in a monotony interval for this function, then substi- 


tuting poe ee 
cx +d 


c+0, then 


gives the curve shown in Fig. 7.6. If the inte- 


reduces the problem to the integration of rational functions 


in the sense of Subsec. 7.1.2. 


ax tb 


~) 
bo 


Improper Integrals 
7.2.1. Types of lnproper Integrals, Examples 


There is an extensive theory of improper Riemann integrals. Here we shall 
restrict ourselves to some remarks and a few examples. Let —o<a<b<o., 
Functions integrable over [a, b] are bounded (according to Definition 3.2.1). 
The first question is whether it is possible to define integrals of unbounded func- 
tions. Further it is of interest whether the integration intervals can be un- 


b  « os 
bounded, ie., f ,f or f . Let us consider two typical cases. 


—co (t — co 


re 7.2. Improper Integrals iB: 


Ist case. For any sufficiently small positive «, let f(x) be integrable over the 
b 


L{aX 


b 
interval [a +e, bj. Hlim f f(y) dy exists, then this value is denoted by f f(x) da, 
and called improper (Riemann) integral (Fig. 7.7). u 


2nd ease. For every number N, N=a, let f(x) be integrable over the interval 


oss Lo} 

[a, N’]. If lim f f(y) dy exists, then this value is denoted by f f(x) dz, and also 
Noro a a 

called improper (Riemann) integral. 


Remark. Of course these two procedures can be combined or applied repeatedly. Tn this 
way it is possible to integrate some kinds of unbounded functions over bounded or un- 
bounded intervals. 


sin x 


Example 1. is contmuous on [0, NV]. Let 7; denote the areas indicated in 


Fig. 7.8, then one obtains 


cG. @ 
] dof Sa ea o 
0 Y kere 


sin ¥ 
dy , 
Gg 


where it has been assumed that fn<x=(k+1) x. As J; is a monotonically de- 


© 


sin 4 
creasing sequence of positive numbers, it follows that i Z dy exists. We shall 
calculate the value of this integral in Subsec. 15.5.1. § Y 


SOF ig. 7.8 
(k+1)30 


Example 2. [f « is a real number, then x” is continuous on every interval [e, 1] 
with O<e<1. Then 


1 J—gttt 
- — for «+-1, 
{ vay= a+1 
x ie: for a=—1. 


i 
From this it follows that lim f y*dy exists if and only if «> —1. For «20 one 
ae i) Ke ; ; 
has an integral in the proper sense, for 0>2=> —1 one has an improper integral. 


a 
Example 3. Similarly it is seen that lim f y’dy exists rf and only ifa<—1. 
L—~o 1 
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7.2.2. Integral Convergence Test for Series, Euler-Mascheroni Constant 


Given the serics 5 aj, aj>0. The question to be answered is whether this 
j=l 
series is convergent or oe In the way indicated in Fig. 7.9, a step func- 
tion h(x) with the steps a4, a, ... is constructed. 


oo 


Lemma. (a) If f(x) 2A(x Vie ys ) dx<oo, then >) aj<o. 
°° j= 1 
b) If OS q(x) Sh(x ) and F gle) )dr=o, then >) aj=oo. 


j=l 


Remark'1. The lemma is an implication of f A(x) da= > a;. We write “=” if the inte- 
grals or series concerned do not converge. 9 ja 


Theorem. There exists a positive number C such that 
: ee il 
lim. (2, Zin n| = 
Remark 2. C is called Euler-Mascheroni ee The theorem is proved by referring to 
the above lemma. One sets g(x) = : ag Then J g(x) dx =In (n +1), and it can be shown that 
n 


lim (35 ——In(n+1 i 2! 


N— oo 


converges (Fig. 7.10). 


7.2.3. The T-Funetion 


Definition. For «+0, 1( =f ee di, 
Remark 1. P(x) is a convergent improper integral. 


Theorem. I(x) 7s a convex, positive function having derivatives up to arbitrary 
order on (O, cc) (Fig. 7.11). 


Fig. 7.11 


Noll ; 8.1. Partial Derivatives 5 


OL 


The following relations hold for I(x). 
[(x)+02 as 2->o and as x0, 
ay i (eed) for x>0 and 
I’(n)=(n—-1)! Oa De. se 


Remark 2. The function / (2) is of great importanee. As I(n) =(n —1)! for natural numbers n, 
it is also written [(@) =(%—1)! ae ae), 


S. Differential Calculus in R,, 


S.1. Partial Derivatives 
8.1.1. Definition 


As defined in Subsec. 6.2.2., R, is normed by |a|=Vx?+ ... +22, where 
t= (2X4, ..., Tn) €An, x; real. We consider real-valued functions f(x) =/(x,, ..., tn), 
whose domains of definition, D(f), are open sets in Rp. 


Definition 1. /(x) has a partial derivative with respect to x; at the point x9°= 
=(A5e * - tr) E Dif) uf 

P(x5) By) in os ea ay 19 Tj; ae +945 a) 

a aan es 

fixed 

ws differentiable at the point a op AD he 


Remark 1. This means that n—1 co-ordinates are Pxen and @(a;) is considered real- 
valued function defined on the z;-interval [a, 6], with x} %E(a, b) (Fig. 8.1). If g(x x?) exists, it 


is written (2; att | (x°). Consequently, 
oo 
Dy — tiny Leese hte BI te oe a) — 
= (Yk, —— SS ; 
Ox; a h—-0 h 


Remark 2. Examples can casily be stated. 


a(n 
oe ( 2 apey |= a,5 wa, real , 
i \k= : 


tg ; by tin akn 
ces CS Ob pak pet 0 x") = > Reilly keg ® 4 mee By ena 
j 


where >) denotes a finite sum (partial derivatives of polynomials), and the Oh jk, WE real. 


6* 
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SO ee 


Remark 3. The following exam ple is of fundamental importance. 
Quy 


f(x, =e erry? for (x, y)+(0, 0), 
0 


igre C=y=0 


xe R,, ye Ry. 


a and ar exist for (a, y)+(0,0). From /(0, y)=/(z, 0)=0 it follows, however, that 
le y : ° 
2 (0, 0) =o (0, 0) =0 exist as well. On the other hand, (x, y) is not continuous at the point 
(0,0). This can We Soe if one introduces polar co-ordinates «=r cos p, y=r sin g, which 
shows that f(a, y) =2 sin y cos y=sin 29 Is independent of r. 


& 


7) 
Lemma. /f f(x) has all tts partial derivatives = (yee ae (x) in a neighbourhood 
vy un 


<oo forj=1, ..., n, then fis continucus 


0 
U of the point x®€ D(f), and uf sup | 2 / (x) 
0, dx; 
at the point x 


Remark 4. The example of Remark 3 shows that the boundedness of the partial derivatives 


vA] q 
in a neighbourhood of 2” is indispensable. In this exorivle, a and ay turn out to be un- 
bounded in a neighbourhood of (0, 0). su y 
aan ; ee epee : of : 
Definition 2. f(x) has the partial derivative of second order ae at the point 
ne ee eck hae 
x€D(f) tf the first partial derivative LL exists in a neighbourhood of x® and if 
, v4) ; . k ; 
the partial derwative —— (=) exists at the point x. 
Ox; OX- 


Remark 5. By iteration, one may then define 


Ba = PB) os 
OL gad DxXp,  OFk yy 0&4, 4 "ico OXp oi 


8.1.2.  Commutativity of Partial Derivatives 


Theorem. If f(x) is continuous in a neighbourhood U of x°€D(f)c Rn, and if the 
of Of 0} 
d 


an exist and are also continuous in U, then 
Ox; ? Bay Lh OXjOXE 


partial derivatives — 


or i eae 
exists in U, DD. and 
On ,0%; 
22 02 
— (x)= —— (r), «EU 
O202 he 


Remark. If /(z) has all its partial derivatives up to the order m in a neighbourhood U, and 
if all these derivatives are continuous in U, then the above theorem can be applied itera- 
tively. This gives 

a or] 


GMO. onc (02 5S Xn 
kim ky Oe, eee Ox, 


=D*f. 


Here 7 =(a), «+; ) is a multiple index. «; indicates how many of the k, are equal to J; 


=) 222: one has |e) = 2, a,—70, 
j=l 


8.1.4, 8.1. Partial Derivatives OF 
8.1.3. Taylor Polynomials 


We shall now generalize Theorem 5.4.4/1 to the n-dimensional case. Hence, 
what is desired are expansions of real-valued functions f(x) at the point x9 € D(f) c 
c Ry. Without loss of generality, we assume that x9=0. Let a= (a, ..., an) again 


n 
be a multiple index with aj;=0, 1, 2, ... and [|= >) aj. Further let a!=a,!, ..., an! 
a a . i=! 
(with O!=1) and a=! ... x”. Finally let O(|x|') denote a function defined in a 
neighbourhood of 0, which can there be estimated by |0(|z|)| =Clz|’, where C is 
independent of |.r]. 


Theorem. Let f(x) be (m+ 1) times continuously differentiable in a neighbourhood U 
of the origin (2.e., let all the derivatives (D*f\(x) with |al=m-+1 exist and be contin- 
uous In a neighbourhood of 0), with m=0, 1, 2, ... 

(a) For xe U, 


fxn= 2 = (Df) (0) x* +O(lal™+!) (Taylor polynomial) . (1) 
jals=m&- 
(b) Tf 
ole Pe agar? + O( lz" +") 


1 
for x€U, then d= Cee (O}s 
eh 


Remark. (b) says that the Taylor polynomial of (a) is the best possible approximation to 
f(x) at the point 0 by a polynomial of degree m. 

Remark 2. By analogy with (5.4.4/3), the remainder R,, =O(|a|"*") in (1) can be represented 
by 


B 
R(t) Ay DOP a) (2) 


'BJ=m+1 


where the a) are numbers independent of x, with oso? s1. 


8.1.4.  2-Dimensional Power Series 


We shall now generalize Theorem 5.4.4/2: find a representation of f(r) by a 
Taylor series in #, in the neighbourhood of the origin: 


1 
jey= (DN Ov, (1) 
One possible approach is based on Eqs. (8.1.3/1) and (8.1.3/2). If Ry(x)-0 as 
m—co, then f(x) can be represented in the form (1). We shall now describe 
another approach. 


Theorem. Let a=0 and |a,|<a for k=1,...,n. For every multiple index x and 
fixed 24, 0.) Lj—1, Litt, +» Xn, led the (one-dimensional) function D* f(xy, ..., Xj, .+5 
ay) be representable as an absolutely convergent power series in [—a, a] (with 
respect to x;, and with 0 as the centre of expansion). Then f(x) can be repre- 
sented as an absolutely convergent Taylor series (in the sense of (1)) tn the cube 


Q= {x | |aj| <a}. 
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8.1.5. Curves and Surfaces in R,, Chain Rule 


Theorem 1 (chain rule). Suppose that F(a, ..., %) has continuous partial dervv- 
atives of first order tn an open domain U in Ry. Further let oj(t), 7=1, ..., n, be 
functions which are continuously differentiable on the interval (a, b). Let (py(0), +5 
pnlt))€U for té(a, b). Then yt)=F (1 (8), -- Pn(é)) ts continuously differentiable 
on (a, b), and 


VO= Zz (oils lO) tela, d) . 


Curves in Ry. If p(t), ..-, Pa(t) are real-valued continuous functions, then 


e(t)=(91(t), +) Pr), t€(a, b)=L, 
is called a curve in Ry. Here J is an interval. x(é) is called a differentiable curve 
if the components 9;(¢) are differentiable functions on 7. Here the tangent vector 
is of interest, being defined as 


Ue (Ee ND ee) 
h=0 


7 yy 4 ’ t€ (a, b), laj<e 
(er. Hig. 8.2). 


Fig. 8.2 


Lemma. [f x(f) vs a differentiable curve, then 
V(t) =(Gi(t), + Pal) for tE(a, b). 


Remark 1. The line passing through the points a(t) and «(t+h), h+0, is called secant line. 
As h-0O, the secant lines approach the tangent linc, whose direction is given by e(#). 


Surfaces in R,. If F(x) = F (xy, ..., en) is a given real-valued function, then the 
set of all points x¢€ R, where F(x) =0 is called a surface in Ay. If a;, 7=1, ..., n, 


and c are real, then 
oi 
Ia) == >, aa,—c—0 


j=0 


n 
isa planein Ry; >; laj|=0. If r=0, then 
pe 


Fla)= 2 (uj )P—P=0 
isa sphere in &,, with centre v9 and radius r. By analogy with the curves, we look 
for tangent planes of surfaces and vectors orthogonal to them. 
Definition 1. Two vectors x= (ay, ..., %) and y=(Yu, »-» Yn) am Ry are said to be 
n 

orthogonal vf >, xjy;=0. 

j=1 
Remark 2. From analytical geometry it is known that, for n =2 and n=3, two vectors « and 


y are orthogonal if and only if they are (in a graphic, geometrical sense) perpendicular to 
each other (provided that |x| |y|>0). 
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Normal yeetor of a plane. A vector »y=(9), ..., %) is called normal vector of the 
n i 
plane F(x) = >) a,x;-c =0 if p is orthogonal to all lines (or, more strictly speak- 
j=t 
ing, to the directions of all lines) extending within the plane. If x° and w are, 
respectively, a fixed and a variable point in the plane, then all the directions in 


a 
question are given by x—.r° (Fig. 8.4). This implies that >) »; (2j—2/)=0 must 
j=! 


R 
hold. On the other hand it is immediately obvious that >) a;(xj—2}) =0. Hence 
j=! 
(41, .-, @,) is anormal vector. It is easily shown that every other normal vector » 
has the form » = (cay, ..-; Cay), ¢ real, and hence is proportional to (ay, ..., @). Our 
object is now to extend these considerations to surfaces. 


a, 
XL Fig. 8.3 [Ki] Fig. 8.4 


n 
Definition 2. Let x° be a pornt of the surface F(x) =0. Then 2 a; (xj — xf) =, with 
n j= 
> la;|=0, zs called tungent plane (of the surface at the point x) if, in a nerghbour- 
j=l 
hood of x°, 
nN 
ae aj(xj— xf) =O (lx —29|2) (1) 
j= 
Then v=(@q, ---; Mm) ts called normal vector (of the surface at the point x®) (Fig. 8.3). 
Remark 3. Hence, tangent planes are defined by approximation properties. This is the gene- 
ralization of the considerations described in Subsec. 3.1.1. For the following theorem, see 
also Lemma 3.1.1/i. 


Theorem 2. [f F(x) has continuous partial derivatives of first and second order in a 
: ee 
neighbourhood of the point x°€ Ry, and if F(x) =0 and es = (x?) 0, then there 


p= Very 
exists exactly one tangent plane (of the surface F(x) =0 at the point x°). [tis given by 


de ele 
= cee (x) (x; x7) =0 : 


ey 


8.1.6. Geometric Interpretation of the Taylor Polynomial 


Suppose that the function f(xy, ..., tn) has continuous partial derivatives of 
first and second order in an open domain U=/(f) in hy. We represent f as a 
surface in Ry+1 (Fig. 8.5), 

an +4 =f(%1, ont in) ? 
or 
Plt yy es 2040) =u 1 Ga) —Fn41=0- 
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F satisfies the assumptions of Theorem 8.1.5/2, where one has to substitute 
n+ for x. Thus 


-(- kos pone 


Ox, : OX, pie OXn : 


= i) +0 
is the normal vector and 


5 


pee 2) at) on as) aa 


is the associated tangent plane. Consequently (in the sense of Theorem 8.1.5/2 
and Kq. (8.1.5/1)), 


ee f(x oe be! 

yi = f(x + 2 aac; (x°) (2; xj) 
is the best approximation to f at the point 29 by a polynomial of degree one. 
This polynomial coincides, however, with the Taylor polynomial of degree one, as 
given in Subsection 8.1.3. If f has higher derivatives, one obtains corresponding 
geometric interpretations for the higher Taylor polynomials of Eq. (8.1.3/1). 
For m= 2 one obtains the best approximation to f(x) at the point x° by surfaces 
of second order, 1.e., by ellipsoids, hyperboloids ete. 


8.1.7. Directional Derivative 


Let »=(v), ..., %m), Where lvl = Vv? + . $ve=1. 


Definition. Z/ f(x) 7s continuously differentiable in the neighbourhood of a point 
aE Ry, then 


is called directional derivative. 
Remark. Without loss of generality, let 2°9=0. As is well known from analytical geometry, 
rotations in Ff, can be described by 


i 


a QjKYe> Jol, wn, 


23 . . . 
where (@;,)j,4=1 18 a proper orthogonal] matrix, Le., 


i i dor j=l. 
2 ajatie=4p for 9-1. 
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In order that the y,-axis coincides with the y-direction, it is necessary that a,,; =v, (ig. 


nN 
8.6). Setting now f(x, ..., %n) =} ee 2, ORE. id =GYy, - Yn), ONe obtains 
(b=il 


at the point r=y=0. Thus the directional derivative proves to be an ordinary 
partial derivative with respect to y,. 


Seo: Implicit Functions and Theorems of Implicit Functions 
8.2.1. Formulation of the Problem 


In Subsec. 3.1.4. we had discussed inverse functions g(y) = f~!(y) of continuous- 
ly differentiable functions f(x) in the neighbourhood of a point x®¢€f,. The 
decisive requirement was that /f’(7°)+0. Jf this requirement is satisfied, f is 
a one-to-one mapping of a neighbourhood U of x® onto a neighbourhood V of 
yo = f(x°) (Fig. 8.7). Let us now extend these considerations to the case of n 
dimensions. Let y=/(r)=(f,(z), «-., fn(x)) be a vector function, where the /;(z) 
are assumed to befunctions continuous in a neighbourhood U of the point x€ Ry. 
We look for conditions which make y=f/(xr) a one-to-one mapping of U onto a 
neighbourhood V of the point y°=f(xr°) (Fig. 8.8). If the functions f(x) are 
twice continuously differentiable on U, then it follows from Theorem 8.1.3 that 


iene 2 (°) yp) Ota — 292), k= 1, (1) 


(x) y) Fig. 8.8 
From Ree Ae hee it is known that the first two terms on the right-hand 
side of Eq. (1) give a one-to-one (and linear) mapping of f,, onto itself if and only 
; Ofte 
if the determinant of the coefficients = CEN) fice Gaon es cere servi, 1 


} 
different from zero. One may now hope that the “‘perturbation term” O (lar — x|2) 
for small values of |2 — 7! does not alter this property. The above determinant of 
coefficients plays a fundamental role and is therefore given a particular name. 
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Definition. Lf y(x) = f(x) = (f1(a), »s fn(x)) has partial derivatives of furst order, then 


“ 


OG gea) nO eee.) ah fn 


Ve, Gon 


: : : Ome 
vs called functional determinant (or Jacobian determinant). (Fo veh) deno- 
O(a, sees Fp) i,k 


tes the minor determinant obtained by deleting the j-th row and the k-th column in 


4 eee ! 
OY so+9 Yn) (minor of the element ct) 
Ch 


OlGi a nee 


8.2.2. Theorem of limplicit Functions, Curvilinear Co-Ordinates 


Theorem 1. Jf the functions f;(x), 7=1, ..., n, are twice continuously differentiable 
ofp GOD fn) (x0 
DOGS eee 
neighbourhood W of «9 such that f(x) =(f,(x), +++; fn(x)) is a@ one-to-one mapping 
of W onto a neighbourhood V of y® = f(x) (Fig. 8.9). The inverse mapping «=9(y) = 
= (g,(y); «1 In(y)) ts twice continuously differentiable in V, and for ke 1,..., 2 ond 
j=1,..., n the following relation holds: 


OY; (Bien) Gio on) 


Remark 1. For n=1 one has the situation described at the beginning of Subsce. 8.2.1.; see 
also Subsec. 3.1.4. 

Remark 2. According to the considerations made in Subsec. 8.2.1., the theorem is plausible. 
Its proof, however, which is based on Banach’s fixed point theorem 2.5.3, is rather comphi- 
cated. 


in a neighbourhood U of ®€ Ry, and af ) +0, then there exists a 


Theorem 2. The assumptions shall be the same as in Theorem 1. Let the functions 
fi(x) be m times continuously differentiable in U. Then the functions gjly), with 
j=1,...,n, are also m times continuously differentiable in V. 


F(x) 
u ep d 
gly) 
(x) (y) Fig. 8.9 Fig. 8.10 


Remark3. If the above assumptions are fulfilled, then for a suitable choice of c; one obtains 
surfaces in IV by 


Pjlay, +) Sn) Sez, JH, 0 


(Fig. 8.10). For that purpose it is only necessary to choose (¢, ..., ¢,) € V. Then it follows from 
Theorem 1 that every point #¢ W can be identified in a one-to-one manner by indicating the 
parameter values ¢,, ..., Cn. Cy, «+, €, are called curvilinear co-ordinates. 


) eas 4 
§.2.4. 8.2. Implicit Functions 63 


ay : ee : 
8.2.3. Theorem of Parameter-Depending Implicit Funetions 


We consider functions of the form f;(x, AVE iy 25 ng gy cry Ar), which de- 
pend on x= (xy, ..., 2») as well as on the parameters 4 = Apex A Vel e) — lees 
Here / varies within a ball, AK ={4 | |A—20| <7 R,. 


Theorem. (a) Lf, for a fixed 2€K, the assumptions of Theorem 8.2.2/1 are satisfied 


s ; 
amd if fia, a) Cue r, 2) are continuous tn an (n +7)-dimensional neighbourhood 


of (2°, A), then ae inverse functions gx(y, A) of Theorem 8.2.2/1 are also continuous 
mn an (n+r)-dimenstonal neighbourhood (y, 2°), where y° = f(x, 2°). 

(b) If, moreover, the functions filx, A) are m times continuously differentiable 
with respect to Ay, ..., Ay in a neighbourhood of 4°, then the inverse functions gxy, A) 
also have partial omaatnes up to the order m with respect to Ay, ..., Ap, in a netgh- 


Ae OFk 
bourhood of 2°, Here the “ are calculated from 
"Q 
of Of; Og 
2, x Bx Ole = OM or 0  ee S—1 ca (1) 
Of» «++ fa) 


Remark. As the Ves determinant ————— 
BF iin, osc Bra) 


(vo) is different from zero, (1) can be 


resolved uniquely for — 


aE 


$.2.4.. Implicit Functions 


The question is whether F(x, ..., e,)=0 can be resolved for one of the vari- 
ables, e.g., x4 f(®2, Ree, vere = ices , Ln), and correspondingly £9 = (x9, ..., x2) 
for 2° = (x7, ..., £4)- The surface in R, ayhich is described by F(x) =0 shall be 
denoted by F. 


Theorem 1. (a) Let F(x) be twice continuously differentiable in an n-dimensional 
oF 

neighbourhood U of x°€ Ry. Let F(x°)=0 and ae (x9) + 0.Then F can be represent- 
t 


ed by x, =f(&) in an (n—1)-dimensional neighbourhood V of £°. f(®) 7s continuous tr 
a neighbourhood of %° (Fig. cally 

(b) If, moreover, F(x) is m times continuously differentiable in U, then f(¥) ts 
Uikewise m times continuously differentiable in V. 


Remark 1. The theorem follows from the implicit function theorems of Subsecs. 8.2.2. and 
Se Lodlo 


Fig. 8.11 
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SE eee ee 


on OF 
Remark 2. If (grad F)(x°) -(— oe ee a 


plied, where one possibly has to substitute another co-ordinate for 24. 


fo) +0, then the above theorem can be ap- 


Generalization: Let us now consider surfaces F,, ..., #; in Ry, which are given by 
F(x) =0 for ae 1, ..., 2. Suppose that l=/<n. Marcher letee ae, a On 
(Banas TE anal correspondingly 0 2 an aye.) forge = (ay, 2477.) Figo .12), 


Say 


3 Fig. 8.12 
( 
| 
Ret ee 
Xo 
Theorem 2. (a) Lf the functions F(x), ..., Fi(x) are twice continuously differentiable 
in an n-dimensional Tee ibouniood U a GOMIe Hing Cae) ihe JOA CIO EEN) fave epee Ni oon 
ae GE ij noes, 2) 


(x°) +0, then A F; can be represented as x; =f;(£), with j= 1,..., 1, 
O(x4, +++) 1) j=t 


in an (n—1)-dimensional neighbourhood V of £°. Here the f;(Z) are continuous in V. 
(b) Lf, moreover, the functions F(x), ..., Fi(x) are m times continuously differen- 


tiable in U, then the functions f,(%), ..., fi(¥) are Wkewise m times continuously 
es in V. 


and 8.2. 3. 


8.3. Extreme Values of Functions 
8.3.1. The One-Dimensional Case 


If y= f(x) is three times continuously differentiable on the interval [a, b], with 
—c~<a~<bh<co (Hig. 8.13), then according to Theorem 5.4.4/1 one has 


flee) = fla) + f'(20) (a — 20) 44) (@— 2924.0 (le — 203). (1) 


2 


Fig. 8.13 


If x is a relative maximum or mininium of f(x), then f’(2°) must be zero, because 
otherwise f(x) would be strictly monotone in a neighbourhood of x9. Conversely, 
if ((x°) =0 and f’(x°)=0, then Eq. (1) shows that f(z) has a relative minimum at 
aw. If f’(x°)=0 and f’’(x°) <0, then a relative maximum exists at x°. If f/(r°) = 
= f(x) =0, then a relative extremum need not necessarily exist at the point 2°, 
as is shown by the example f(x) =23 near the origin. 
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8.3.2. The n-Dimensional Case 


Definition 1. Let f(x) = f(xy, ..., an) be continuous in a neighbourhood of the point 
OE Don. ; 

(a) f(x) ts a relative maximum cf f(x) = f(x) for all x in a suitable neighbourhood 
of x. 

(b) f(x) ts a relative minimum tf f(x) = f(x) for all x in a switable neighbourhood 
of x9. 


n 
Definition 2. Let aj, be real numbers. The quadratic form >) 4;,x5;5, is called posi- 
on 
. He 
tive definite if there exists a positive number c such that >, aj45j5_2c |6|2 for all 


ECR,; it ts called negative definite if there exists a positive number c such that 

n 

D> aesisns —e él? for all €€ Ry, where &=(&, ..., &n). 
q;k=1 

gt a 
Remark. >) ¢j§; with c;>0 is positive definite. 
jai 

Theorem. Let f(x) be three times continuously differentiable in a neighbourhood of 
rau € I hipne af 
(a) If f(x°) ts @ relative extremum (minimum or maximum), then ae (x°) =0 for 
i=l hans Ws J 


2 
(b) Let a G0 jer j= 1, aa S cas (a) €;6, 7s positive definite, 
OX; pha aL OxjOXk 
n 

then f(x°) 1s a relative minimum. If 2 A ae 2 ) 58, ws negative definite, 
then f(x®) is a relative maximum. jb Onj0x 
Y. Integral Calculus in R,, 
o.1. Definitions and Properties 


9.1.1. O-Domains and J-Domains 


Let Q= {a | x€ Ry, |e— x| <a; for j=1,..., 2} be an n-dimensional interval in 
Ry (Fig. 9.1), and let |Q| = 2a, . dy be its saolkniines Here the a; are positive num- 
bers. A Q-domain 2 is a finite union of 2-dimensional intervals (Fig. 9.2), 


Une (U a). (1) 
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ok a2) 
rice Fig. 9.1 Fig. 9.2 


Here Q; is the closure of the n-dimensional open interval Q; (i.e. the n-dimensional 
interval including its boundary surface). Further let dw be the boundary of a set 
win Ry, (Fig. 9.3): y€éew if and only if every neighbourhood of y contains points 
which belong to , but also points which do not belong to m. We shall use the 
term “domain” whenever the set concerned is open. In this sense, 2 of Eq. (1) is 
a domain. 


Definition. A bounded domain Qin RK, is called an I-domain vf for every positive 
number e there exists a finite covering of 022 by n-dimensional intervals Q; , 


N;, avg 
Ole > OF sichthae 92 10 1 Nig, 4), 
i r=l 
Qe 
aw Fig. 9.3 OQ Fig. 9.4 
* 
Lemma. Let Q be a bounded domain in Ry, with 6Q= 0) Fj, where F; ts an tntervor 
=1 
patch of a surface G;in Ry which is represented by G; ee a 9.5 and Fig. 9.6). 
If Gj(x) has continuous partial derivatives of fir ) {+0 for 
xEG;, then Q is an I-domain. 
@ 
G/s Bop 
Fig. 9.5 fi Fig. 9.6 


Remark. /-domains are a relatively general concept. They include bounded domains with 
smooth boundaries (also with holes), but also certain domains with edges and corners. 


9.1.2. Integrals over O-Domains 


Let Q be a Q-domain, then we consider arbitrary finite decompositions Z into 


cee 
n-dimensional intervals: Q2= U Q; (Fig. 9.7). 
jet 


position Z Fig. 9.7 
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Definition. Let f(x) be a real-valued bounded function with the domain of definition 
D(f)= 2, where Q is u Q-domain. 


N 
(a) da= sip >>, @aulere Fa PA jl inf f(y) , 
ae 4 = ¥CQ; 
Me 
a f(x ee 2 Lu, witee => = 2, lQ;| sup f(y) . 
j=l veg a 


(b) f(x) zs sacd to be integrable nee) dr=f Hee ales: 


Remark 1. This is the generalization of Def. 3.2.1. As in that definition, sup (or inf) denotes 
Z Z 

the supremum (or infimum) taken over all permissible decompositions of the kind indicated 

above, where V is variable. 


Remark 2. The statements made in Subsec. 3.2.1. can be transferred immediately: Lemma 
3.2.1/1 remains valid. With a suitable modification, Lemma 3.2.1/2 is also applicable. 


Remark 3. Ii / is integrable, we write f f(x) dz or f f(x) dx instead of f /(x) dx or S (Hx) dx. 
5 eS 


9.1.3. Properties 


The theorems of this subsection are analogues to corresponding propositions 
made in- Subsecs. 3.2.2. and 3.2.3. 


Theorem 1. Let f(x) and g(x) be functions integrable over the Q-domain Q. 
(a) Let 2 and wu be real numbers, then Af(x)+ ug(x) vs integrable over 2, and 


J (Ale) + ngla)) dr=2 f fla) dtu f gle) de. 


(b) [f(x)| 2s integrable, and | f f(x) dz|= 3 lf(x)| dx. 
2 
(ec) If, moreover, f(x) =g(zx), then f f(x) nee ge) Neda. 
9 


Definition. /f {f;(2)}7., and f(x) are real-valued functions defined on Q, then f;=>f on 
Q (uniform convergence) means that for every positive number e there exists a natural 
number jo(e) such that sup |f(r)—f;(x)l =e for all 7 with 7 = joe). 

EQ 


Theorem 2. Let the functions fj(x), j=1, 2, 3, ..., be integrable over the Q-domain 
Otand lel fo) on. 1 hen f(x) 7s gai over Q, and 


ie dain: fiilx )d 


jroo 


9.1.4. Integrable Functions 


Theorem. (a) Let 2 be a Q-domain, then every function that is continuous on 2 is 
integrable. 
(b) Let w be an I-domain, and let f(x) be continuous on @ (Fig. 9.8). [f 2 a a 
Q-domain und @ & 2, then 
fix) for eo 
i fA for fese oO 


is integrable over Q. 
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Fig. 9.8 


Remark. w and Q are open. From @ CQ it follows that w has a positive distance from 92. 
Further it is seen that f g(x) dx is independent of Q. 
2 


9.1.5. Integrals over J-Domains 


Definition. Let f(x) be a function continuous on @, where w is an I-domain. Let Q 
be a Q-domain with © <Q, then one sets 
for wr€o, 


: ; Re) 
fiz) dr=f g(a) dx, where gte)={f for 2€Q—. 


Remark. From Theorem 9.1.4 and Remark 9.1.4 it follows that the definition is meaningful 
and that f f(z) dz is independent of 2. 


a 


Theorem 1. 7 f(x), g(x) are assumed to be continuous, then Theorems 9.1.3/1 and 
9.1.3/2 remain valid for I-domains, too. 


Theorem 2. Let w, wo, and oy be three I-domains, and let © =@,U @y and @,M w,=0 
(Fig. 9.9), then, for every function f(x) that ts continuous on @, 


i f(x) da= f f(x) da+ f flv) dex. 


G(Y Lies, 9) 


9.1.6. Iteration Theorem for n-Dimensional Integrals 


For a domain w in Ry, let o=ol {x | x, =c} be the intérsection of w with the 
plane x,=c (Fig. 9.10). If w is an n-dimensional Q-domain, then w, is either 
empty or an (x — 1)-dimensional Q-domain. 


Theorem. Jf w is a Q-domain, and if f(x) is continuous on @, then 
b 
Sf f(x) dx =f ( (olcee. Xn) dan} dxn , (1) 
oO a On 


PO ene te — (tira ty) Od a — eee ee ola), 


fh 
Fig. 9.10 


Remark 1. According to the above considerations, the inner, (x —1)-dimensional integral 
over ,, is meaningful. It gives a function that is integrable over the interval [a, b]. 


Fig. 9.11 


9 : , 
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Remark 2. If @=@Q is an n-dimensional interval (Fig. 9.12) and if f(~) =g(a*) h(a,), then one 
obtains 


b 
(ie) ell He”) cla [Ge alee, = 
Q ey a 


Remark 3. If o=Q= {vx | a;<2;<6;} is an n-dimensional interval, then an iterative applica- 
tion of Eq. (1) gives 


bn f &n—-1 a 
ieee —1) ( ff see ( Lf Bele ccee Baila Ba) aus) is tent} Gkey » 
il 


@ Gp \@n—1 


Remark 4. The theorem is not fully satisfactory; in particular the restriction to Q-domains 
is inconvenient. Later on we shall generalize this theorem substantially (Fubini’s theorem). 


oe. Transformation Formulas, Volume and Area Measurement 


Oa, Volume Measurement 


Definition. Let w be an I-domain in Ry, then lo] =f Idx is the volume of w. 
wo 
Remark 1. This definition is compatible with the statement made in Subsec. 9.1.1., saying 


that the n-dimensional interval Q= {a | |x; —arf | <aj} has the volume |Q|=2"a,...a,. To 
prove this, we consider the more general case of a parallelepipedon. Let ol Gees Gis 


j=1, ..., n be linearly independent vectors in kp, then they span a parallelepipedon with the 


n 
2 (comers) > eat (Fig. 9.13). Here e,=0 or g;=1.A parallelepipedon is an J-domain; its vol- 


j=l 
ume is denoted by |(a}, ..., a”)|. 


pane 
a? 
a’ 
0 Fig. 9.13 


(Gee) detaUr... a”)|, where 


Lemma. 
Ged), 
det (a!,..., a™)=| : : 
Cinco, 
Remark 2. It is clear that n-dimensional intervals are special parallelepipedons and that the 
volume formula indicated above is valid for n-dimensional intervals. 


n 
Remark 3. Let yj = D> djzxyz be a rotationin k,, then from the lemma as wellas from clemen- 


tary calculating rules for determinants it follows that the volume of a parallelepipedon is 
rotation-invariant. This implies that the volume of an /-domain is rotation-invariant, too. 
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9.2.2. Transformation Formulas 


Lemma. Let y=f(x)=(f (a), ..., falv)) with y=f(x), and let f(x) be twice con- 
tinuously differentiable in a neighbourhood of x°€ Ry. Letz= gy) = (GG) ony) 
with 2=qg(y), and let g)(y) be twice continuously differentiable in a Tae’ 
of y®€ Ry. If z=g(f(x)) = h(x) = (h(a), «.., An(x)), then 2 =h(x°), where the hj(x) are 
twice continuously differentiable in a seiqthemyerd of x9, and (Fig. 9.14) 


Cle ty 2-2 a) oe Ol 245 e-5 2h) ,) AY peas Un) (10) 
OX, nee Ly) AY; one) Yn) O(%), ae! Ln) 


(y) 


(2) Fig. 9.14 


a sees Yn 
I f,mor See) (x9) 
AG, an) 


relation holds: 


+0, then for the inverse mapping x= x(y) the following 


Ys, +++ ae » Yn) ‘ OL) tery » Xn) (y)=1. 


AC an) OY)» nee + Yn) 
OY O08) Yn) : : 
Remark 1. If A ae (x) +0, then it follows from Theorem 8.2.2./1 that y(a) is a 
Lo cree Uy 


one-to-one mapping of a neighbourhood w of x® onto a neighbourhood 2 of y®. In particu- 
lar, there exists an Inverse mapping. 


Theorem. Let y=y(x) be a one-to-one mapping of the I-domain wc Ry, onto the 
a? Yio sor , Yn) 


o(x Lys teey , Xn) 
x= xy) is the vnverse fede cire of y(x), then g(y) = f(x(y)) ts continuous on Q, and 


I-domain 2c Ry, with ——— (x) +0 for x€@. If f(a) vs continuous on @ and if 


EAE | dy. 


I ie) eel, gly) | eine ase eh | 


w 


Remark 2, Note that it is the absolute value of the Jacobian determinant which appears in 
the last formula. 


9.2.3. Are Length of Curves 


Differentiable curves x(¢) = (p,(d), .--, gn(¢)) in Ry have been dealt with in Sub- 
sec. 8.1.5. Here we assume that the tangent v(t) =(—;(¢), ..., v,(t)) +0, where the 
functions g;(t) are assumed to be continuously differentiable on [a, b], with 
—eo-=<q<bh<oo. Further let a=t)<t)<...<44,=6. The points z(t) and x(t;+1), 
poleve reed sale Coro eee by line segments (Fig. 9.15). This gives a poly- 


gon P of length Lp= > la(t;+41) —x(d;)|. 


j=0 


go ‘ A y 
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ee ee ee 8 eee eee 


i=a Fig. 9.15 


Definition. L=sup Lp ts the length of the curve, where the supremum has to be 
taken over all polygons of the kind described above. 


. fn 
Theorem. L= | | > vi (t) dé. 


a aS 


9.24. Area Measurement 


We want to measure the area of surfaces F which are represented in Ry11 by 
F (xy, ...5 Xn +1) =0 (Fig. 9.16). According to Subsec. 8.2.4. we may assume without 
lossOt generat y that 2 (a, etn 1) =f 2)— oy, Where 7 =(x,, =, ty). Hence 
F is represented by xn 41=f(2), r€mC Ry. Let w be an n-dimensional /-domain. 
The method is analogous to that described in Subsec. 9.2.3. For n=2 we perform 
a triangulation: points are marked on F and interconnected by triangles. Thus 
the surface is approximated by a web whose segments are triangles. For n +2 the 
web consists of patches 4; of n-dimensional planes. As these patches are n-di- 
mensional J-domains, the volumes |4;| can be determined according to Subsec. 
Salle 


Fig. 9.16 


Definition |F|=sup >) |4;| vs the area of the surface F, where the supremum has 
BF 
to be taken over all permissible decompositions of the kind indicated above. 


Remark 1. For n=2 the definition is satisfactory, because the triangulation is easy to 
visualize. For n>2, imagination will fail, and a more preeise formulation of the above defi- 
nition will be required. We shall not, however, go into greater detail here. 


Theorem. Jf f(x) has continuous partial derivatives of first order ina neighbourhood 
of @, then 


Pale f lgrad F| dr= i Yu > 


van (1) 


of 

j=t | On; 
" af a 

Remark 2. One has grad F={—,..., —_, 
Oa, Oi. 

ds=jgrad F| dz (2) 


. 


= i) +0. The expression 


is usually called surface element on F’. ds as well as the formula in the above theorem can be 
given anintuitively geometrie interpretation by referring to “infinitesimal” surface ele- 


Fee 
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pcre Re aa a is a 


x 
ments (instead of the above plane segments). From Fig. 9.17 It is observed that a5 


lyon 4at 6 
gis where »y=grad F is the normal vector and ,,,; 1s the (n+ 1)-th component of ». 


P| - . . es . eye 
Substituting now v and |v, ,,|=1, one obtains Eq. (2). which also gives an Intuitiv ely geo- 


metric interpretation of (1). 


v IF 
as z 
I Gas 
i; aw! 
i ax 
ax Fig. 9.17 


9.2.5. Surface Integrals 


We consider the same surface F as in Subsec. 9.2.4. The general point on F' is 
denoted by s. Every s€F is assigned a unique r€@ by s =s(x). If a function g(s) 
is defined on F, then we obtain a corresponding function G(x) =g(s(x)) defined on 
@. If G(x) is continuous on @, g(s) is said to be continuous on F’. 


Definition. Let g(s) be a real-valued function that is defined and continuous on F, 
then we set 


fs) e DS): G(x) |grad F| dx. (1) 


Remark 1. The geometric interpretation of this formula follows from Remark 9.2.4/2. In 
particular, the Icft-hand side of Eq. (1) is independent of the specific choice of the co-ordi- 
nates x, ...,%,. It is also possible to define the left-hand side of Eq. (1) directly on F by 
extending Def. 9.1.2 to F. Note that, for Eq. (1) to be valid, F must have the form described 
at the beginning of Subset. 9.2.4. 


Remark2. Let / be an arbitrary surface F(a, ..., @,41)=O0in Ry 44, with (grad F)(x) +0 on F. 
Then / can be decomposed into partial surfaces F',, F's, ..., where each of the partial surfaces 
has the standard shape indicated in Subsec. 9.2.4. (where possibly z,,, must be replaced 
by 7, ..-,%,3 ef. Subsec. 8.2.4.) (Fig.9.18). Now it is possible to calculate f g(s) ds by adding 
together the integrals f g(s) ds. ia 

oi 


ss 

. <3 
A 4, (\?/ 

Fig. 9.18 y Wei, Oil 


Remark 3. A possible application of surface integrals is as follows. Let Q be a smooth, 
bounded domain in #,, being covered by smooth surfaces F,, with F(x) =t (Fig. 9.19). 
Here tis a parameter, t¢€(a, 6), and F(z) is continuously differentiable on Q, with |grad F(2)| 
+0 for €Q. If 

dx =o(s;, t) ds,dt, o(s,, t)>0, 


where ds, is the surface element on /,, then one obtains 


b 


J g{x)da= f ( J GSe #) ofS, t) ds,) dr. 
2 Cit, 


DEG. 
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Here 9(x) is continuous on Q,s,€ F;, and §(s;, t) = g(x) if # corresponds to the point (, t)€ F;. 
The factor o(s;, #) must be determined in each particular case. We consider an important 
example. Let ds, be the surface element of an n-dimensional sphere of radius r=0 centered 
at the origin (Fig. 9.20). Then dx =ds,dr (ie., in this case 9 =1 because, roughly speaking, 
ds, and dr are orthogonal). If ds=ds, is the surface element of the unit sphere, then ds, = 
r™—lde, and hence dz =r?-!dsdr. ; 


AN 


Fig. 9.20 


9.2.6. The Unit Ball, P (7) 

|1’,| shall denote the volume of the unit ball V,={x| lal<1}in R,, and lo, 
shall denote the area of the surface wy = {x | |x| =1} of Vy. Well-known values 
are |w| = 27 (definition of =), |J’,|=7x as well as |wsl =47 and [V,| == T. 


Theorem. 


(a) 


p=HBy, By Ty ove 


‘ (-fE) for n=2,4,6,... (O!=1), 


tel s 
_ 
~~” 
— 
wl es 
| 
bo 
~~ 
to| 
bol Re 
—< 
2) || 
~~ 
@) 

S 


2 » 
252 (mye 1 jor w= 2,450)... 
ony ees [P/E 
a : - 2 JOT ie 3, OF Tacs 
2 T 
lon| 


Remark 1. The proof is based on a trick. One calculates the 2-dimensional improper Rie- 
mann integral f e—|%7l dz in two ways: 


Ry 
© gs n es 
a er - 
| e-? dit] = i e—l2? da= i | e—ryn—1 ae a 16 (5) ; 
oo Ry 0 Op - 


where we have used the formula stated at the end of Subsec. 9.2.5. From this one obtains 
all the propositions of the theorem relatively easily; ef. [66], Appendix 2. 


Remark 2. 


w,| attains its maximum for n=7. Further, |w,| +0 as n>. 


4 
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9.2.7. Improper Integrals 


By analogy with Subsec. 7.2.1., where improper one-dimensional integrals 
have been considered, it is also possible to study improper n-dimensional Rie- 
mann integrals. Again we shall restrict ourselves to two types. 

(1) Let w be an /-domain in F,, then o,= a — {x | |a —x°|<e} is also an J-do- 
main, provided that 7®°€w and e=0 is sufficiently small (Fig. 9.21). If f(x) is 
continuous on ®— {x9}, one may ask whether lim f f(x) dx exists. If this is the 


&{0 w, 


case, the limit is denoted by f f(x) dz, and called improper (Riemann) integral. 


wm 
he 


(7) 
Fig. 9.21 


(2) Oy = {x | 1<|z|<N} is an J-domain. If f(x) is continuous for [zl 21, then, 
by analogy with (1), one can investigate the limit 


Lima /a;) die = a, f(x) dx 


No Qu 
Lemma. (a) f |a|* dx exists if and only if «> —n. 
{|< 
(b) ff lal* dx exists if and only ifa<—n. 
|a|> I 


Remark. This is the analogue to Example 7.2.1/2. In the case (a), x? =0 in the sense indicated 
above. 


9.3. Integral Theorems 


9.3.1. Gauss’ Theorem 


Domains which are columnar in the direction of x, are bounded by a finit 
number of smooth surfaces, where the boundary surfaces have the properties of 
the surfaces F; of Lemma 9.1.1 netic hs the AF euem a Tay properties of 
G;(x)) (Fig. 9. 22), Let 20 = (@ie tye.) ander ae “Ani, & 2,) be two bound- 
ary points that differ only in their r,-co-ordinates, x, )> ai, then the line seg- 
Met 2 (yy eo a le ri <%y_<.x)} is assumed to be included in the do- 
main. Columnar domains are bounded. Riemann standard domains are domains 
composed of finitely many domains which are columnar with respect to all of 
the directions 7, ..., % (Fig. 9.23). Let »=(», ..., v) be the normed outer normal, 


hence |y|=1 (cr Subsec. 8.1.5.: if the surface is given by F(x)=0, then y= 


grad F : : : : 
=e Ea) . This normal exists at every boundary point, except for possi-- 


columnar Riemann 
domains Fig. 9.22 standard domain Fig. 9.23 
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ble ridge points, which are of no importance. Let «= (a), ...,¢) and b= Onur 
1 

be two vectors in R,, then (a, b)= >) ajb; is their scalar product. For |a/=1 and 

lb] =1 one obtains: pe 


{ 
n n Byrn 
I(a, b= > lajl lols ( 5 i) be vj] =i. 
; j=l jet = 
Hence we can set (a, b) =cos a with O=e=n. ais called the angle included by the 
normed vectors a and 6. We do not risk confusion when writing cos (a, 6) instead 
of cos a, where it is understood that (a, b) in cos (a, b) does not mean the scalar 
product (a, b). Let e;= (0,..., 0, 1, 0, ..., 0) be the unit vector in the direction of 2;, 
then »;=(», e;)=cos (v, xj), the latter being a common (though not quite correct) 
notation. Hence, 
P= (Cos @ 2;), -.., cos (?, x,)) « 


Theorem 1 (Gauss integral formula). Let 2 be a Riemann standard domain, and 
let f(x) be continuously differentiable on Q, then, for 7=1, ..., n, 
° C a 
| eh Co eclia | }(s) cos (vg, xj) ds . 
« Ox; . 
dQ 
Here ds is the surface element of 02 and vs ts the normed outer normal at the point 
SEOQ. 


Definition. If the functions f,(x), j=1, .... n, are differentiable at the point x°¢ Ry, 
then one denotes : 
Soi 
(liv /)(2°)—= (29), eoleere fH (fs vee fn) 
jai OX; 
Theorem 2. Jf 2 is a Riemann standard domain, and tf the functions f(x), j= 
=1,..., n, are continuously differentiable on Q, then 


J (div f) (x) dx=f (f, ») ds, 
dQ 


is) 
n 

where f=(f1, -»s fn) and (f, vy) = > f(s) cos (vs, 2;). Here ds is the surface element of 
q=1 


02 und vz is the normed outer normal at the point s€eQ. 


ORece Green’s Formulas 


Definition. Let f(x) be twice differentiable at the point x9 € In, then one sets (Af)(c°) = 


nm 92 
— os (2°) (Laplacian differential operator). 
a 


(me, 
Theorem 1 (First Green formula). [f Q isa Riemann standard domain, and if f(x) 
is twice and g(x) ts once continuously differentiable on Q, then 


° > . Co 
{ eheaN VAS in) Bled i (grad f, grad g) dx + | g a ds . 
‘ 2 ae 


of . Wieck er 
Here v=v, is the normed outer normal to 02 and 2 is the directional derivative of 
yp 
Subsec. 8.1.7. 
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EE 
Remark 1. Again ds is the surface element of 0Q, and 

(grad /, grad g) = i on i, 


Theorem 2 (Second Green formula). If 2 is a Riemann standard domain, and if 
f(x) and q(x) are twice continuously differentiable on Q, then 


fo 2) (Af) () — fla) (4g) (2) de= flea 2) as 


dQ 


Ca Peat a ae 
Here v=, ts the normed outer normal to 02, and = ts the directional derivative of 
Subsec. 8.1.7. G 


Remark 2. The differential and integral calculus in R, developed in Chapters 8. and 9. 
frequently appeals to intuitive power. This is especially true for the integral theorems 
of Sec. 9.3. For a modern, rigorous but yet largely elementary introduction into this subject, 
the reader is referred to [60]. 


10. Ordinary Differential Equations (Methods of Solution) 


10.1. Separable, Homogeneous, and Exact Differential Equations 


10.1.1. Formulation of the Problem 


In Subsees. 4.1.2. and 4.2.3., we had formulated the typical initial value prob- 
lem for ordinary differential equations of order n: desired are real functions 
f(x) that are defined and » times continuously differentiable in a neighbourhood 
of 2», solve the differential equation f(x) =h(x, f(x), ..., {@-(x)), and assume 
the initial values /(x 9) =c;, where 7=0, ..., »— ik A corresponding problem has 
to be solved for systems of differential equations. In Chapter 4, the existence and 
uniqueness of the solutions of such problems has been established, provided that 
h exhibits certain smoothness properties. In Chapter 5 elementary functions 
have been defined by means of ordinary differential equations. Now we choose 
the reverse approach: desired are explicit solutions for special classes of ordinary 
differential equations. Here we utilize the theory of elementary functions and the 
one-dimensional integral calculus. 


10.1.2. Separable Differential Equations 
Theorem. Lei —o<a<b<-= and —o<e<d<o (Fig. 10.1.). Further let f,(é) and 


fo(t) be continuously differentiable on (a, bland [c,d], respectively, and let fo(t) +0 
for t€[e, d]. Lf x)€ (a, b) and yo€ (c, d), then the unique solution of the initial value 


HO HB 10.1, Separable, Homogeneous, Exact Diff. Equ. aa 


problem, 
¥=fil2) Aly), Y%) =o » (1) 

can be calculated from 
du 
fo(u) 


Remark 1. Theorem 4.2.3 implies the existence and uniqueness of the solution (at least 


— | fe) dv=0. (2) 


X 


: : oF 1 
loeally in a neighbourhood of {xo. yo}). As =— (Xo, Yo) = 77—~ #0, Theorem 8.2.4/1 shows 
oy f2(Yo) 


that F(a, y)=0 can be represented as y=y(x) in a neighbourhood of x. One has y)=y(2y). 
This is the desired solution of Eq. (1). Whether the integrals in (2) can be ealeulated expli- 
citly, and whether the resulting equation can be resolved explicitly for y=y(x) by means 
of elementary functions, depends on f, and fs. 


Remark 2. Formally, one obtains (2) as follows. 


d 
ree fo(x) is written in the form Ty ahh) da, 


and the last expression is integrated: 


ie 
Ie /\(e)dvu+e, c real. (3) 


Here the lower Iie of integration in (3) is omitted. f 


1 
ae and fa /,\(v) dv are arbitrary 
ge 


primitives of —— and /,(v), respectively, in the sense of Subsee. 3.2.5. Then the arbi- 


Taw) 


trary parameter c is given a value that makes the eurve pass through {2, Yo}- 


Remark 3 (Examples). 1. Find solutions of y’= 2 One obtams 


dy dz 
=———,, Ingy—natine and y—cz . 
yo 


These are straight lines passing through the origin (Fig. 10.2). Tf it is required that the line 
passes through a given point {2 9, y)}, then ¢ must be determined accordingly. 


x . 
2. Find solutions of y’= ——. One obtains 
Y 
9 ¢ 9 9 
ee and #-+y2=c. 
2 
oa ae 
fi 
/ *,¥,} 
t 
\ 
\ 
/ 
j b / 


i oS Fig. 10.2 oe Fig. 10.3 
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These are concentric circles centered at the origin (Fig. 10.3). These two examples also 
show that it is not always desirable to resolve F(x, y) =0 of Kq. (2) for y. 


x 
Remark 4 (Special cases). 1. If y’=f(«) =/f,(x) (ie., if fo(y) =1), then y=ygt+ f f(v) dv. The 


Xo 
integral curves are shifted in parallel with the y-axis (Fig. 10.4). 2. If y’=/(y) =foly) (Le. if 
u 
f(z) =1), then «—ay= | sa . The integral curves are shifted in parallel with the a-axis 
(Fig. 10.5). J f(u) 


Yo 


Fig. 10.4 oe | te Fig. 10.5 


10.1.3. Homogeneous Differential Equations 


Theorem. The setup y(x)=-xz(x) reduces the homogeneous differential equation 
: ; : —z 
y=] (2) to the separable differential equation z= gle : 
xv 


1 
Remark. In the sense of Eq. (10.1.2/2), one has f,(x) = and hence f Ac dv=lIna. Natu- 


rally, f must have the necessary differentiability properties in order that Theorem 10.1.2 be 
applicable. 


10.1.4. Exaet Differential Equations 


Theorem. Let —o<a<b<<« and —o<c<d<oo (Fig. 10.6). Let the functions 
P(x, y) and Q(x, y) be continuously differentiable on the rectangle R = (a, b) x (ce, d), 
and let 


OE oq 
3 (DA Fe UN) for WER. (1) 
Further let Q(x9, Yo) +0 for a fired point {x 9, yo} CR. If 
x u 
M(x, y)= f Plu, yo) dut+ fQ(x, v) dv for (x, yJEk, (2) 
vo Yo 


then the unique solution of 
P(x, y) + Q(x, y)y’=0, (x0) =Yo» (3) 


can be determined from M(x, y)=0 in a neighbourhood of {x9, yo}. 


C 
GaneX x 6b Fig. 10.6 Xoo *Yo Fig. 10.7 
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Remark 1. In Eq. (2) we have assumed that r=ay and y=yy. If «<a, one has to interpret 
v to 
(as usual) f as — f . The y-integral is interpreted analogously. From Subsec. 4.2.3. it follows 


2 x 
that Eq. (3) has a unique solution in a neighbourhood of {xg, yo}. Further, with the use of 
Eq. (1) one obtains 
oM Pp ou ; 
gp aE Ys ee ev) — On, y)- (4) 
‘ OM 
In particular one has py y) +0 in a neighbourhood of {ao, yp}. Theorem 8.2.4/1 then 


shows that M(x, y)=0 can locally be represented as y=y(x), where yy=y(Xp). This is the 
solution of Eq. (3). 


Remark 2. The path of integration in (2) is indicated by the solid line in Fig. 10.7. Choosing 
the broken line, one obtains : 


ES y x 
M(x, y)= f Oxo, v) dvu+ f Plu, y) du. 
Yo x 


i} 
With the use of Eq. (1) this gives A(x, y) = M(a, y). In other words: the two paths are 
completely equivalent, as it should also be expected. 


Remark 3. Differential equations of the type (3), (1) are called exact differential equations. 
According to (4) they have the form 
0M oM 


ie Oho = (5) 


Conversely, if for two continuously differentiable functions P(x, y) and Q(z, y) one wants 
to find a function M(a, y) that is twice continuously differentiable and satisfies (4), then 
it follows from Theorem 8.1.2 that 


oP PM aQ 


Hence (1) is necessary and sufficient for the differential equation im (3) to be written in the 
form (5). 
Remark 4. If 4/(x, y) is continuously differentiable, then 
aM oM 
M =— + 
d On da + om 


is called the total differential of M/. If M(z, y) is the function of (2), then according to (4) 
one has 


dy (6) 


dM=P da+@Q dy. (7) 
Thus the approach to the solution of exact differential equations can be described as follows. 
Desired are functions M(x, y) which show that P dz+@ dy is a total differential in the 
sense of (6), (7). Then functions y(x) for which M(a, y(x)) =c are solutions of P+Qy’=0. 


Remark 5. The differential equation «+yy’=0 referred to in Remark 10.1.2/3 is of the 


aP dQ . 
exact type. One has P=a and Q=y, and hence Fy ea According to (4), 


1 ; 
M(x, y=5 (2+ y2). This gives the solutions. «7 +y? =c 


10.1.5. Integrating Factor 


In Subsec. 10.1.4. we dealt with differential equations of the type P+Qy’ =0 
by representing them in the form (10.1.4/5). The necessary and sufficient condi- 
tion for this approach was that (10.1.4/1) was truc. If this condition is not 
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fulfilled, it is possible to look for an exact differential equation which has the 
same solution y(x) as P+Qy’ = 0. If, for example, p(x, y)>0, then every solution 
of P+Qy’ =0 is also a solution of 

wea, y) Pla, y) + ur y) Qa y) y= 0 (1) 
and vice versa. (1) becomes an exact differential equation if 


a oO Ou Ou OR 00) 
Lt p= a) ee Pacem 2 
0-5 HP) = (= Ge PE u( GE (2) 


For given P and Q this is a partial differential equation for u(x, y). In general, 
partial differential equations are much more complicated than ordinary ones. A 
systematic theory of solution for (2) cannot be stated. But in special cases it is 
possible to guess solutions u(x, y). Then u(x, -y) is called integrating factor. It 
reduces the original problem to an exact differential equation. 


10.2. Linear Differential Equations of First Order 
10.2.1. The Equation y’=f(x)y 


Theorem. Let —c<a<b<o and xy€[a, b]. Lf f(x) ts continuous on [a, b], then 


x 
f f@)du 
y(x) = ype » @BeEb, (1) 


is the unique solution of the problem 
y=fr) ys y(%o) = Yo - 
fs 


& 
Remark 1. As usual, one has to interpret fas — f ‘if B2BMyy. 
ry x 

Remark 2. The conditions of problem 4.1.2/1 are satisfied. This, according to Theorem 4.2.2, 
ensures the existence and uniqueness of the solution. y’=/(x)y is a special separable differ- 
‘ential equation as described in Subsec. 10.1.2. In this case, however, it is easy to find an 
explicit solution of F(a, y)=0 of Eq. (10.1.2/2) (where Theorem 8.2.4/1 with its additional 
differentiability assumptions is not required). 


10.2.2. The Inhomogencous Linear Differential Equation 


Theorem. Let —o<a<b<< and xg€[a, b]. If f(x) and g(x) are continuous on 
[a, b|, then 


av He oe 
f foddu f{f@du x —f fwdu 
y(x) = yor” + e% ce? g(z)dz, a=xz=b, (1) 


x 
is the unique solution of the problem 
y=tx) y+g(e), y(%)=Yo- (2) 


Remark 1. As regards the interpretation, existence and uniqueness, see the remarks of 
Subsec. 10.2.1. 


Remark 2. What is more interesting than the complicated formula for y(z) is its derivation by 
means of a method of variation of the constant. The general solution of the homogeneous 


HON oale 10.3. Linear Systems of Differential Equations 81 


equation y’=/(x) y is obtained from Eq. (10.2.1/1) by regarding yy=c as an arbitrary real 
parameter. To solve the differential equation in (2), one uses the setup 


a 
f fQodu 
u(x) =e(n) e% 


i.e., the eonstant ¢ in the solution of the homogeneous equation is varied, ¢ =¢(x). Substitu- 
tion of this setup into the equation y’ =f(x)y +9(z) gives the following differential equation 
for ¢(x): 
wv 
— f faodu 
(x) =9(e)e * 
The solution of this equation is (1). When the functions f(z) and g(x) are given explicitly, it 


is often preferable to follow this straightforward way of solution instead of using the given 
formula (1): this will simplify the caleulation. 


10.3. Linear Systems of Differential Equations of First Order 
10.3.1. Fundamental Systems and Wronski’s Determinant 
If f(x) are real-valued functions continuous on [a, b], then 
7 

yj (x) =) fir(z) yelz), y(t) =? 5 (1) 
where j= 1, ..., n, has a unique solution. Here -«~<a<b<oanda=r=b. Fur- 
ther, y{, ..., y are given real numbers and x)€[a, b] (Fig. 10.8). This follows 
from Subsec. 4.1.2. (problem 2) and Theorem 4.2.1. In contrast to the general 


case discussed there, (1) is linear with respect to y, ..., Yn (this explains the name 
“linear systems of differential equations”). 


Theorem 1. Let (y1,1(2), > Yin(2)) and (y2,(r), oo Y2,n(x)) be solutions of 


yil2) = B foe) yale, F= boos (2) 


If c; and cy are real numbers, then (crys a(x) + eoyr,t(Z), «05 C1Yt n(x) + CoY2,n(x)) ts 


also a solution of (2) (linear combination). 


Remark 1. What is meant is that (2) shall hold with y;, instead of Ya» OF with yy, instead 
yp (corresponding to Yy1j OF Y,;). The proof is simple. For abbreviation, from ro we 
shall write 9(a) =(y1(x), --» Yn()). The linear combination in the above theorem is thee 
written as ¢,¥1(%) +¢.%2(2), where it is understood that 7, and 7 are the two solutions. Now 
the following problem arises: can every solution of (2) be represented asa linear combination 


of special solutions? 
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Definition 1. The solutions yj(x) =(yj,1(%), «+ Yjn(x)) of (2), with F=H1, ..., m, are 
called fundamental system if, for all x€[a, 5], 
Yi CX) ++ Yt nlx) 
Wig: : +0 
Yn(X) +++ Ynn(X) 
(W(x) is called Wronski’s determinant). 


Remark 2. Hence for the 7; one has 
Yj ela -2 frul%) Yj,a(%) 2) 


WING P= Tle ones eee nonry 10 


Theorem 2. (a) There exist fundamental systems for (2). 

(b) If W(ao) +0 for a point xo€[a, b] with respect to the system yy, --.; Yn of solu- 
tions of (2), then W(x)+0 for all x withasa=sb. a 

(c) If Ula), --) Ya(z) 18 @ ie system of (2), then (x a cjy;(x) 18 the 
general solution of (2). Here cy, ..., €n are arbitrary ia ee i 


Remark 3. To prove (b), one shows that 


a)=(3 fila )) Wa (@) igre elu - 


If Wao) =0 for a point x €[a, b], then it follows from the uniqueness theorem that W(z)=0 
for a=z=b. This leads to the following alternative: either W(x) +0 for all pole 6] or 
W(x) =0 for all 7€[a, 6]. Now the proof of (a) is straightforward. The solutions 7%, ..., Jn, of 
(2), for instance, are a fundamental system if y; 4(%) =0;,4 for a point x €[a, 6]. 


Definition 2. The solutions; 9j(x) =(yj,1(&), «+» Yjn(a)) of (2) (or (2’)), where j= 


=1,...,m, are said to be linearly dependent if there exist real numbers C4, ..-; Cm; 
with g lej| +0, such that 
j=l mo 
> cyi(x)=0 forall xw€[a, 6]. (3) 
jai 


Tf such numbers Ey ony Cm do not exist, so that (3) holds for all x€l[a, b), then the 
SOLULIONS Yj, ++» Ym are said to be linearly independent. 

Theorem 3. The solutions ¥,(x), ..., Yn(x) of (2) (or (2’)) are linearly dependent if 
and only if W(x)=0 for all x€[a, 6]. 


Remark 4. Consequently, according to Remark 3, 7, .... Yn are linearly independent if and 
only if there exists a point a) €[a, 6] such that W(a9) +0. 


10.3.2. Inhomogeneous Systems of Differential Equations 


In Remark 10.2.2/2 we had described the method of variation of the constants 
for the solution of inhomogeneous differential equations. The method can be 
transferred to systems. Desired are solutions of the inhomogeneous linear sys- 
tem of CHEE equations of first order 


=D file) (z) +92), fal wn, (1) 
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7 S — 

Here fjx(x) and g;(x) are real functions that are continuous on [a, bj, with 

= SS As regards the existence and uniqueness of solutions, the state- 

ments made in Subsec. 10.3.1. are also true here (cf. Theorem 4.2.1). Let 4;= 

= (ni, 8 Yjn)> J=1,...,n, be a fundamental system for (10.3.1/2) in the sense 
of Def. 10.3.1/1, then (by analogy with Subsec. 10.2.2.) we make the setup 

n 
ylr)= 2, eae) y(e) . (2) 
If this setup is introduced in (1), one obtains the following differential equations 


for #he functions ¢,Cr), ..., ¢,(x): 
R 


4 er(x) yi s(x) =gi(x), fH1,...n and x€[a, 6). (3) 


The determinant of the (y;(x))f;-1 is W(x), and hence is different from zero for 
x€[a, 6]. Consequently, (3) can be resolved for ¢;(r), ..., c,(x), and intégrated. 
Substitution of the result into (2) gives a special solution %(2x) of (1). The general 
solution of (1) is then given by 

n 


Y (x) = Yo(2) ye byj(r), x€[a, b], 
j= 


where by, ..., by, are arbitrary real numbers. Hence, if a fundamental system for 
the homogeneous system of equations is known, it can be used to construct the 
general solution of the inhomogeneous system. 


10.3.3. Special Systems of Differential Equations 
Suppose that all the assumptions made in Subsecs. 10.3.1. and 10.3.2. are 


fulfilled. In addition, let /;;(7)=0 for k>j. Consequently, (10.3.2./1) has the 
form 


yi=hiays 7 
y2=frayit fo,2y2 ane, 
93 = fay + f3,2Y2 + fads a. 
ee ti oe. + fan + Yn + 


Here f;4=fj4(2), gj =gj(x) and y;=y;(x). Such triangular systems can be solved 
explicitly. The solution y,(x) is found according to Subsec. 10.2.2. Substituting 
y,(x) into the second row, one finds the solution y,(x) likewise according to Sub- 
sec. 10.2.2. Iteration shows that in this way all the y;(r) can be determined step 
by step. By this method it is possible to fulfil given initial conditions y;(79) =a;, 
fica Meenas 

Remark. In general it is not possible to find explicit solutions for arbitrary systems of the 
form (10.3.2/1). 


10.3.4. Systems of Differential Equations with Constant Coefficients 


From the considerations set out in Subsecs. 10.3.2. and 10.3.3., it follows that 
systems of the form (10.3.2/1) can be solved explicitly if they cau be reduced to 
triangular systems as described in Subsec. 10.3.3, This is not generally possible, 
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Be EEE 


except when the functions f; .(2) of (10.3.2/1) are constants. Further it follows from 
Subsec. 10.3.2. that we may restrict ourselves to the homogeneous case g;(x) = 0. 
Setting 


fit fin 
| ; : , 7.7) real mimmnbers, 
In, oes fan 
we can write the homogeneous system (10.3.2/1) with constant coefficients /;,4(2) = 
=e as 
i (e) = FUa) (1) 


7 n 
where, as previously, 7=(yy «> Yn) and rg=(3 Hine eo, ina) (matrix 
j=) jal 


multiplication). With the setup z= Ay, where A = (aj0)%,-f is a matrix with con- 
stant cocfficients, Eq. (1) becomes 


2’ (x)= AFA~'2 (x) . (2) 


Here we assume that the determinant of («;,4)%;.1 is different from zero. From 
the matrix calculus it is known that A can be so chosen that Af'A~! has a tri- 
angular form (with zeros above the main diagonal). This system (and hence also 
the original system of equations) can be solved in the manner described in Sub- 
sec. 10.3.3. The details, which shall not be discussed here, are rather complicated. 
A special case corresponding to a differential equation of order n shall be studied 
more closely later on. 


10.4. Linear Differential Equations of Higher Order 
10.4.1. Formulation of the Problem 


In fa, b], with —o<a<b<~, we consider the linear differential equation of 
order n, 


n—-1 ~ 
yer) = B file) Ya) +900), 2 


where fo(2), «+, fe—1(x) and g(x) are real-valued functions continuous on [a, 6]. In 
particular, the assumptions of Problem 1 of Subsec. 4.1.2. are fulfilled. Then, ac- 
cording to Theorem 4.2.2., there exists a unique solution of (1) for given initial 
conditions y)(a_)=c;, 7=0, ..., »—1, where x )€fa, 6] (Fig. 10.9). As has been 
stated in Remark 4.2.2, Eq. (1) reduces to a special system of first order. If 
2j(x) =y(a) for 7=0, 1, .... n—1, then 


a0 = al 


Be 
ll 

R 

i) 


Aj = , en—1 
Zn— =[o{x) 2g ++ +fn—1(%) en-14+9(2) - 
This is a special system of equations, but unfortunately it has not the form dealt 


with in Subsec. 10.3.3. The conclusions of 10.3.1., 10.3.2. and 10.3.4. can, how- 
ever, be transferred to this case, which shall be done in the next two subsections. 
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10.4.2. Fundamental Systems and Wronski’s Determinant 


The functions /;(x) have the same meaning as in Subsec. 10.4.1. 


Definition. The solutions y;(x), ..., Yn(x) of the homogeneous differential equation 


y(x)= 2 fitz) yo) (1) 


70 
are called a fundamental system if, for all x€[a, b] , 
Ys) Yi(z) oe YF PC) | 
ce eet : : +0 
Yn(X) Yn(r) Ys (x) 
(Wronski’s determinant). 


Remark 1. V(x) coincides with W(x) of Def. 10.3.1/1 with respect to the system of Subsec. 
10.4.1, 


Remark 2. As in Def. 10.3.1/2, solutions y;(z), ..., ¥m(a) of (1) are called linearly dependent 

if there exist real numbers ¢, ...,¢, such that 2 le;|>0 and Di cyy;(x) =0 for x€[a, bj. 

if such numbers do not exist, then y,(2), ..., Pee called ety independent. 

Theorem. (a) [f y,(x), ..., Ym(x) are solutions of (1), and tf cy, ..., Cm are real num- 
m 


bers, then 2 cjyj(x) 1s also a solution of (1). 


j 
(b) Lf yy(x), +) Ym(x) ts a fundamental system, then every solution y(x) of (1) can 
n 


be represented as y(x)= D, cjyj(x), with suitable real numbers cy, ..., Cn- 


c) There exist fundamental systems for (1). 

d) The solutions y,(x), ..-, yn(x) of (1) are linearly dependent if and only if 
x)=0 for x€[a, b]. 

e) Lf yy(x), «+5 Yn(x) are solutions of (1), then either W(x) +0 for all x€[a, 6] or 
W(x2)=0 for all x€[a, 5}. 


Remark 3. The above theorem follows from Subsec. 10.4.1. and the considerations set out 
in Sec.10.3. for systems of differential equations. 


( 
( 
W( 
( 


10.4.3. Differential Equations with Constant Coefficients 


For real numbers fj, ..., fr, where f,=1, we consider the linear differential 
equation of order x 


2 fy(x)=0 (1) 


8 Triebel, Math. Physics 
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with constant coefficients. It is seen immediately that y(7) =e is a solution of 


Rr 
(1) if and only if 2 is a (generally complex) zero of the polynomial P(A) = mS tine 


0) 
(For the moment, we also admit complex functions as solutions.) Thus it is 
clear that pairwise different zeros 4), ..., 2) with their multiplicities my, ..., mz 
play a decisive role. According to Subsec. 5.3.3. one has 


AQ) = = fal=Q—ayl... QA). 


rea 


Theorem. 
Aya Ayx m,—1 Aye 
Cg eee 
Aga Agx Mg —1 Age 
Go. ME 5 cog CP LE 5 (2) 
AZ A m)—1_ Aza 
Ce re a ae! 


is a (complex) fundamental system for (1). 

Remark. These are n functions, because n=m,+ ... +m. Aceording to Theorem 5.3.3/3, 
i, is likewise a zero of P(A), with multiplicity m, (here we can assume that Im /,+0). 
If, for example, 2,=/,, then it is easy to obtain 2m,=m,+mzy real linearly independent 
solutions of (1) by linear combination of the first two rows of (2). By pairwise combination 
it is thus possible to construct a real fundamental system. On the other hand, (2) is also a 
fundamental system if the coefficients /; are complex (and in this case a real combination is 
neither generally possible nor desirable). 


10.5. Continuous Dependence on Initial Values 


10.5.1. Systems of Differential Equations of First Order 


We consider first-order systems 


oj (x) = hylx, Yy(@), +» Ynl@)), —-Ys(%o) = 6 5 (1) 
j=1,...,n,on [a,b]. Here —©o<a<b<oand ag€[a, b]. The functions hj(xy, ..., tn) 
are assumed to fulfil the conditions of Problem 4,1.2/2. According to Theorem 
4.2.1, the system (1) has a unique solution. The question to be answered is how 
this solution y;(x, c1, ..., Cn), 7= 1, ..., n, depends on the initial values cy, ..., Cn. 


Theorem. There exist a number +0 and a number L = L(6)=>0 such that 


n 
Drei, Ca — yy oe) = ea | 
j=l = 
for all real numbers C,, ..., Cn Ey, +) €m and all x€[xy—6, ro +4) [a, 5] (contin- 
uous dependence on the initial values). 
Remark. This is a local statement. Accordingly, local assumptions concerning the functions 
h;, for example in the sense of Subsee. 4.2.3., will also be sufficient. 


10.5.2. Differential Equations of Order n 


According to Subsec. 10.4.1., differential equations of order xn can be reduced 
to systems of first-order differential equations. Consequently there is also an 
analogue to Theorem 10.5.1 for differential equations of order x. Let us consider 
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the problem 

COC (2), (x)), yPlxo)—e;, (1) 
where j=0, ...,n—1, r€[a, b]. Here, —o<a<b<o and x,€fa, b]. The func- 
(IO) Gehan, Gaeaen)) se oscil’ to fulfil the conditions of Problem 4.1.2/1. Accord- 
ing to teeter meee lg. (1) has a unique solution ¥(r, co, ..., ¢,—1)- 


Theorem. There exist a number d>0 and a number L= L(5)>+0 such that 

n—-1 n—-1 

: Lyx, Cy v*25 Cn —1) — yx, Gp cove Caen =f Ms, ley —é) 

7=0 t=0 
for all real numbers Co, +) €n—15 Ey «5 €n—1 and all x € [xg —4, Xo + 6M [a, 5] (contin- 
uous dependence on initial values). 


Remark. The statement made in Remark 10.5.1 applies analogously. 


10.5.3. Continuous Dependence on the Right-Hand-Side 


We shall restrict ourselves to differential equations of first order. Correspond- 
ing assertions are, however, also true for systems for first-order equations and 
for differential equations of higher order. As in Subsec. 4.2.3., let Q be an open 
domain in the r,y-plane. Let {xp, yo} € 2 and {x9, Fo} EQ. laine ber let h(x, y) and 
h(x, y) be real-valued functions that are continuous and bounded on and fulfil 
the Lipschitz condition 

A(x, y)—A(x, yYNSM ly-y'l, lhe, y)-h@, Ys ly-y' 
for all {x, y}€Q and {z, y’}€Q (ef. Subsec. 4.2.3.). According to Theorem 4.2.3, 
the differential equations 

y' (x) =hlx, y(z)), (ro) = Yo» 
and 

¥ (x)=h(2, ¥(x)), ¥(Xo) = =Yo > 
have unique solutions (this holds at least locally on an interval [x9 —6, x9 +-6]) 
(Fig. 10.10). The problem now is to compare these two solutions. 


Fig. 10.10 
x 


Theorem. There exist a number 6>0 and a number L=1(6)>0 such that 
Wx) —y(x)| Sh (6 sup |A(u, v)—h(w, v)l + lyo— Gol) 
furpen 
for all admissible xo, Yo. Go and xE[xy—4, ry +4). 
Remark 1. “Admissible”? means that the rectangles of Fig. 4.5 in Subsee. 4.2.3. are in- 
eluded in 2. 
Remark 2. The theorems stated in Subsees. 10.5.1. and 10.5.2. as well as the above theorem 


are useful for numerical and physical applications: small perturbations of the initial values 
and of the differential equation will have little effect on the solution. 


Remark 3. Qualitative aspects of ordinary differential equations play an essential role in 
modern mathematics. The books [4, 5, 27] give an introduction into these fields of the theory. 


BF 


88 11. Calculus of Variations MLSS. 


These barbarians measure everything with the 
same yardstick, the theorem as well as the epi- 
gram... May they distrust their senses narcotized 
by the opium of differential and ‘integral calculus. 
(Friedrich IJ, King of Prussia, on Euler and 
@Alembert, 1762) 


11. Caleulus of Variations 


11.1. Fundamental Equations of the Calculus of Variations 
11.1.1. Formulation of the Problems 


To illustrate the problems to be solved, let us consider a simple case that will be 
generalized later on. Let L(t, u;, v;) be a real-valued function that is continuous 
on [a, b] x Ry. For a function a(t) which is continuously differentiable on [a, b], 
one may consider the integral 

b . 
: dx 
L(x(t)) = | L(t, w(t), &O) de where &()=— (0 - (1) 
a 
The problem is to find functions continuously differentiable on [a, 6], such that 
L(x(t)) attains an extremum. 


Ist Problem (variational problems with free boundary values). Admitted to compe- 
tition are all functions a(t) that are continuously differentiable on [a, bd). 


2nd Problem (variational problems with fixed boundary values). Admitted to com- 
petition are only those functions continuously differentiable on [a, b] which 
assume given boundary values at a and 6b: x(a)=¢q and x(b) =e» (Fig. 11.1). 


gaa Cy 


Fig. 11.1 
a b 
Neighbouring curves. We shall now describe the method by which extremal cur- 
ves, or extremals, are determined. For a given z(t), let 


b 
L(y, €) mee L(t, x(t) + ep(t), &(4) + eP(t)) de, 


where g(t) is an arbitrary function which is continuously differentiable on [a, b] 
(with y(«)=(b)=0 in the case of the second problem). « is a real parameter. 


: eed 
x(t) is called extremal if aE T(g, €) le=o =0 for all admissible y(t). This does not 


quite correspond to the usual approach in the calculus of variations, but will 
suffice for our purposes. In this way, variational problems for functions have 
been reduced to the problem of finding extreme values of functions; cf. Sec. 8.3. 
In Sec. 8.3., points had been varied in order to determine extreme values of func- 
tions. Now it is functions that are varied. 
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11.1.2. Preparatory Considerations 


We need two preparatory lemmas, which are also of interest on their own: one 
concerning the differentiation of parameter-dependent integrals, and the funda- 
mental lemma of the calculus of variations. 


Lemma 1. Let L(t, 4) be @ real-valued function that is continuous on [a,b] x Ry, 
with —co<a<b<eo, and differentiable with respect to 2, with t€[a, b] being fixed. 


Oe 
Let Aa L{t, 4) be continuous on fa, b]x Ry. If a(a) and b(A) are two real-valued 
functions being continuously differentiable on R,, with a<a(a)<b(A)<b, then 


b(A) 
f L(t, 4) dt is continuously differentiable on Ry, and 


oo b(A) b(A) 
ee VSIA We an eNdeee |) eoene ot 
aa! L(t, 4) da= L(b(A), )—L(a(a), wns | aed 

a a 


Lemma 2 (fundamental lemma of the calculus of variations). Let y(t) be continuous 
on [a, b], and let 
b 
Sf y(t) ¢(t) dt=0 
a 
for all functions y(t) that are continuously differentiable on [a,b] and vanish in a 


neighbourhood on the right of a as well as in a neighbourhood on the left of b, (Fig. 
Ie )perhenmy (Dron [arb]. 


11.1.3. The Euler-Lagrange Equations 


Definition. Suppose that L(t, wy, «0.5 Un, V4y «+> Un) ts real-valued and continuous on 
[a, 6] X Ron, with —0<a<b <, where tt has continuous partial derivatives of first 
and second order with respect to all of its variables. If e is a real number, and if 
X(t), --, Xn(C), Py(t), --» Pn(t) are real-valued functions that ure continuously differ- 
entiable on [a, 6], then let 


L(G og ny ©) 


b 
= Lf Lt, x(t) + ep,(t), Co) Xn(t) + €Gn(f), 4 4(0) + ef (¢), C002 En(é) + €Pn(t)) dt. 


(a) x(¢) =(x4(0), -..) 2n(0)) ts called extremal of the variational problem with free 
boundary values tf 


d 
Gens re) eae (1) 
de 


for all p(t) =(94(0), .-, Gn(t)) as defined above. 

(b) x(t) as called extremal of the variational problem with fixed boundary values 
if (1) holds for all of the above p(t) that satisfy pj(a)=9;)(b) =9 for j= 1, «.., n. 
Remark 1. This is a generalization (and more precise definition) of the situation described 
in Subsec. 11.1.1. 
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Theorem 1 (Euler-Lagrange equations). x(t) 7s an extremal of the variational prob- 
lem with fixed boundary values if and only if 


ay dad jel 

== (—-|=0 fer f=1,.257 ard Vicia, Ol- ae 

Oxj dt (Fn) om (5) 2) 
Remark 2. The equations (2) are to be understood as follows. The functions 

oL ob OL OL 

=== : nee pS Mn nrg =9,(t 

Ou; Ou; (t, x it). a nit)) Ox; Ov; (t. x i(t) er) gj ) 


depend on t€[a. 6] alone. Here uw; and v; have the same meaning as in the above definition. 
So one first has to take the partial derivatives in (2), then insert x(f). and thereafter form 


d : d fal 
ae =O Naa) 


Theorem 2. x(¢) 7s an extremal of the variational problem with free boundary values 
tf (2) holds and, in addition, 

oL , OL ‘ 

aa; (b, x4(d), SOS) 0) ~ Ba; (a. 2): SoD A) = (3) 


peveatis/ved fOr p= Vy wey 1 


Remark 3. In the case of the variational problem with free boundary values one automati- 
cally gets certain boundary conditions for an extremal (so-called natural boundary condi- 
tions). 


Remark 4+. Taking the derivative with respect to / in (2), one obtains 


CL) En i 
= m= 2, SSS = 
Ox; Olde; <1 Oxide; | Ah aaa; © 
alee 
Ge lle osoy fo lakene ee This is a system of ordinary differential equations of 
dé? 


second order, which is generally non-linear, since L=L(t, x(t), ..., 2n(t)). Using the same 

trick as in, say, Subsec. 10.4.1., one can reduce this system to a system of first order. Tf 
Or L ae oe 

ea is different from zero. then it is possible to resolve (4) for 

0% 10%; 1,7 =1 

the highest-order derivatives. 


the determinant of ( 


11.2. Examples 
11.2.1. A Preliminary Physical Remark 


The considerations presented in Subsec. 11.1.3. are of great importance for 
theoretical physics. One attempts to describe physical processes (c.g. in classical 
mechanics, electrodynamics, or in the special or general theory of relativity) by 
variational principles. Such a physical situation is assigned a Lagrangian den- 
sity, which corresponds closely the function / of Def. 11.1.3. The physical pro- 
cess in question is then found to take place so that L(x, (/)),asdefined in (ola), 
assumes an extreme value. If L coincides with the function of Def. 11.1.3, then 
this means that the process is described by the Euler-Lagrange equations 
(11.1.3/2). Many physical processes are described by partial differential equa- 
tions rather than by ordinary ones. If one wishes to stick to the concept of 
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variational principles, the considerations of Sec. 11.1. must be substantially 
generalized. In Chap. 24 we will investigate more closely the mathematical and 
physical aspects of this principle. For classical point mechanics, however, the 
results of Sec. 11.1. are fully sufficient. A more detailed description, especially of 
the ical aspect of this matter, will be given at the beginning of Chap. 12. 
In See. 11.2. we consider a few examples w hich will, however give an impression 
of the physical importance of variational patnetnelten. 


11.2.2. The Brachistoehrone 


Consider a mass point (e.g., a steel ball) that moves under the action of gravity 
on a certain path line (e.g., a rail) from A to B (Fig. 11.3). Suppose that the 
motion takes place in a plane perpendicular to the earth’s surface. Further sup- 
pose that initially the ball is at rest at the point A. For which path (or paths) 
will the ball require minimum time to get from A to B? If the ball moves in the 
straight line from A to B (broken line), then this is the shortest path indeed, but 
it will take some time for the ball to accelerate. If, on the other hand, a path like 
that indicated by the dash-dot line is chosen, then the ball will rapidly reach high 
velocities, but the path has now become comparatively long. The physical com- 
promise might be the path indicated by the solid line. Let A be the origin of a 
y,c-co-ordinate system (Fig. 11.4). Let y(x) be the path of the ball, i.e., ¥(0)=0, 
and let v=v(x) be its velocity. If m is the mass of the ball, then its total energy 
at the point P is given by 


m cee: : 
E =— v?— mgy =kinetic energy + potential energy . 
4A 

- ot? 

€ SG 

SES 

Ss 
(ieee os Cea Fig. 11.4 


m : , 
Here g denotes the acceleration of gravity, g~9.81 = Using the physical fact 


that E(x) =E is a constant independent of x a ss If a law), and the 


initial conditions v(0)=y(0)=0, one obtains v(x) = V2gy(x). If s(x) denotes the 
length of the aus y(x) from A to P, then in the light u he 023. one che 
tains dy 


s(x) zi Veale de. yy ee 
0 


If tis the time required for the ball to move from A to P, then ¢ is a strictly in- 


ds | 
creasing function, and for the inverse function s =s(¢) one has v(é) he Then; bs 


Theorem 3.1.4, —- If 7 is the time required for the ball to move from A to B 
Sv 
on the path y(x), one now obtains 
raed a Poy Le yay 
: ie ds i 7 Soe iy UE Nee 
—_ ={ —-—dr=] —Ysl a — fa 
Ih | dt | AG Ae | ; Vity’? dx | oFn) 


0 0) ) 0 
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Thus the problem of minimizing 7 is an extreme value problem in the sense of 
Def. 11.1.3, where 
1+y2 


Le, y y= (1) 


Paths for which 7 attains a minimum are called brachistochrones. The petty 
difficulty that L becomes singular for y=0 can be avoided. For that purpose it is 
convenient to consider the curve in a parametric representation, y=y(w) and 
C= Til): 


Theorem. Zhe brachistochrones are cyclowds, 
a(w)=clu—sin wu), y(w)=c(1—cos u) (2) 
where 0S u=2n, and c>0 ts a parameter. 


Remark 1. The curve (2) is obtained by solving the Euler-Lagrange equation (11.1.3/2). 
Here »=1, and one has to substitute x for ¢ and y for x. One obtains an ordinary differential 
equation of second order, where it has been taken into account in (2) that the only solutions 
of interest are those passing through A. x(u) is strictly monotonic, and hence there is an 
inverse function, and (2) can be represented as y=y(x). By variation of ¢ one obtains a 
schlicht and gapless covering of the whole quadrant {z>0, y>0}. Hence, if B is given, there 
exists exactly one c >0 such that (2) passes through A and B (Fig. 11.5). 


Remark 2. The cycloid allows of the following nice geometric interpretation. Suppose a 
wheel is rolling on a road. Find the trajectory of the valve of this wheel (Fig. 11.6). If ¢ is 
the radius of the wheel, then Figs. 11.7 and 11.8 show that the cycloid (2) describes the 
trajectory of the valve. 


wheel 


Fig. 11.5 valve road 


p Se ee 
Fig. 11.7 VA Nae Fig. 11.8 


Fil O KC 2KC 
Px (Cu~-¢ Sinu, C-C cosu) 


11.2.3. The Problem of the Straight Line as the Shortest Line Connecting Two 
Points 


(This is the heading of the 4th problem of the famous lecture by D. Hilbert on 
“Mathematische Probleme’, Paris, 1900 [85].) It deals with curves a(¢) =(x,(¢), ..., 
x(t), 97 =t=t,, which are continuously differentiable in R, and pass through two 
given points A and B, with z(t) =A and a(t,)=B. It is further supposed that 
2@)=(4,0), .-4 tn) +0 lor ip =t=f. Accordine to Theorem 9.2.2, 

t 


L=f ¥#0)+..4+ #21) at (1) 
by 
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is the length of the curve from 4 to B. What is desired are curves of minimum 
length. 


Theorem. The straight line from A to B is the unique curve of minimum length be- 
tween these two-points. 

Remark 1. This is a variational problem as defined in Subsec. 11.1.3., with [Bing coo Op) = 

ie en ee : ; : : : 

= Vr4 +... +0,. As regards the condition that L be twice differentiable, there is a minor 
difficulty at 0. which can, however, be avoided by choosing the parametcr to be, say, 
t=a,, provided that 4 and B have different co-ordinate values x, (the total number of 
comparable curves will then be somewhat restricted, but this is of no consequence). If 


the above formulation is retained, the solution of the differential equation (11.1.3/2) gives 
the desired result. 


Remark 2. We will study problems of this kind in detail later on in the fields of differential 
geometry and general relativity. Thus one may look for curves that lie in a given surface 
and pass through two given points. Considerations of this kind are necessary, for example, 
in calculating the paths of light rays in four-dimensional curved space-times by the general 
theory of relativity. 


11.2.4. Rotation-Symmetric Minimal Surfaces 


Take a closed wire sling and dip it into soap solution. After removing it from 
the solution, you may have the luck to find a thin liquid film spanning the sling; 
it realizes a surface of minimum area that passes through the sling. In other 
words, one considers two-dimensional surfaces that pass through a given closed 
curve in #,. Desired are surfaces of minimum area in the sense of Subsec. 9.2.4. 
This is Plateau’s problem; the corresponding surfaces are called minimal sur- 
faces. The general theory of these surfaces is relatively complex, but a special case 
is within our reach: given two points A and B in the upper half-plane. Let y(x) be 
a curve in the upper half-plane which is twice continuously differentiable and 
connects these two points. If this curve is allowed to rotate about the x-axis, it 
generates a two-dimensional surface of revolution in #, (Fig. 11.9). 


Lemma. The area of the surface of revolution just described vs 
eal ————— 
L=2r f y(x) ¥1+y%(x) dz. 
2% 


Remark 1. What is now desired are curves y(x) that minimize the area L of this surface. 
We treat this question as a variational problem in the sense of Def. 11.1.3, with L(u, v) = 
2au V1+02 Solving the associated Euler-Lagrange differential equation (11.1.3/2), one 
obtains the following result. 

Theorem. Jf a is a positive parameter and b a real one, then 


ea? (1) 
a 


y(x) =a cosh 
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is the generating line of a rotation-symmetric minimal surface. Here cosh denotes 
the hyperbolic cosine defined in Subsec. 7.1.4. 


Remark 2. The generated rotation-symmetric minimal surface is called catenoid. We shall 
not discuss here how @ and b must be chosen (provided that this is possible) in order to make 
the curve (1) pass through the given points A and B. 


... the task is suggested to us ...to treat those 
physieal disciplines axiomatically in which already 
today mathematics plays a prominent role; above 
all, these fields are the theory of probability and 
mechanics. 

(D. Hilbert, ‘‘Mathematische Probleme’, Paris, 
1900, [85]) 


12. Principles of Classical Mechanics 


12.1. Modelling in Physics 
12.1.1. On the Relationship between Mathematics and Physies 


When we speak of physics here, we always mean theoretical physics. A mathe- 
matician takes a different view of physics than does a physicist. The latter is 
interested in including experimental facts into existing theories, in the (qualita- 
tive and quantitative) prediction of new effects, and in the development of new 
theories if the old ones do no longer perform satisfactorily. For him, mathematics 
is a tool. (In circles of well-meaning physicists, they like to compare physics to 
a queen served by mathematics in a double role: bearing the trail behind and 
the torch ahead.) New theories (or new propositions within existing theories) 
must carry conviction, the mathematics used must be credible. Here it is quite 
customary (and perhaps even necessary, if one breaks new ground) that mathe- 
matical deduction and physical argumentation go hand in hand. This presuppo- 
ses a sure physical feeling. In the history of physics there are many very impres- 
sive examples of this. Perhaps a mathematician would feel reminded of the 
instinetive certainty with which once Euler handled divergent infinite series. 
It is just this physical feeling that even physically interested mathematicians 
(and other ones we are not speaking of here) are often lacking in. For a mathema- 
tician, the mathematics used must be not only credible but definitely logically 
founded: and this makes an enormous difference. Further, he will try to separate 
mathematics carefully from its physical interpretation. This leads to axiomaties 
and modelling. The heart of such an axiomatic approach is a mathematical 
theory which, per se, has nothing to do with physics. Intereonnecting mechanisms 
and interpretations are required to translate the physical problems into the 
mathematical language used. The situation resembles that of a computer which, 
taken separately, is capable of functioning and has its internal logics, but is of no 
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use until a whole set of input and output units have been connected. By their 
experimental colleagues, theoretical physicists arc often cross-examined about 
the sense or use of their theories. They take their revenge for that on the mathe- 
maticians who deal with the axiomatics of physical Hiconines: The question 
they ask is: “What will this bring about physically?” There arc two possible an- 
swers: (1) nothing, and (2) eee The first answer has been given from the 
questioner’s point of view, for axiomatics cannot make any contribution to 
those problems he is interested in (see above): it does not create any physically 
new theories, nor explain any uew effects. After all, axiomatics seems to be pos- 
sible (or even desirable?) not until the physical discipline in question has devel- 
oped to a relatively advanced statc. (Perhaps like a black hole as the decline of 
life of a one-time shining star? This would also account for the magic attraction.) 
The second answer hits the gist of the matter: axiomatics creates mathematical 
certainty. !) It is hke the construction of wide highways that open up a wonderful 
physical landscape for mathematical tourism. (After a period of modesty, the 
physical footsloggers will also tramp along these ways.) Fortification of the base 
is of advantage also for those pioneers who cut their way with the cutlass through 
the jungle at the frontline of physics. In this sense, the axiomatics of physical 
disciplines is not only mathematically desirable, but also physically necessary. 


Remark. We quote a few relevant opinions of outstanding physicists and mathematicians. 
The Nobel laureate 8S. Weinberg writes in [71]: ‘““Physies is not a finished logical system. 
Rather, at any moment it spans a great confusion of ideas, some that survive like folk 
epics from the heroie periods of the past, and others that arise like utopian novels from our 
dim premonitions of a future grand synthesis”. In R. K. Sachs, H. Wu [53], one finds the 
following sentences: ‘‘Physical theories are guessed, not deduced; if only deduction were 
required, every competent hack could be an Einstein or a Feynman ... No wonder a 
mathematician, accustomed to regarding a mathematical framework as primary, finds the 
plethora of alternatives confusing. The physics view is that nature is primary, and that even 
the most careful, mathematically rigorous mathematical models are «t best approximations 
to nature. So the physicist finds nothing odd in having to switch models occasionally, and 
indeed revels in the intuitive arguments needed to decide which model is least inaccurate 
physically’. The Nobel laureate E. Segré writes in [57]: “Mathematics is the natural lan- 
guage of physics, as Galileo pointed out, and altough Volta and Faraday wrote great 
physics without using a formal mathematical language, they thought mathematically, 
and their ignorance of standard mathematics makes them less, not more, mtelligible”’. 

In “Physics and Philosophy” the Nobel laureate W. Heisenberg remarks: “The conception 
of objective reality ... evaporated into the ... mathematics that represents no longer the 
behaviour of elementary particles but rather our knowledge of this behaviour”. In “The 
Character of Physical Law” the Nobel laureate R. Feynman writes: “Mathematicians 
like to make their reasoning as general as possible. If you say to them ‘I want to talk about 
ordinary three dimensional space’ they say ‘If you have a space of n dimensions, then here 
are the theorems’. “But I only want the case 3°.‘ Well substitute n=3’ ...So a certain 
amount of reducing is necessary, because the mathematicians have prepared these things 
for a wide range of problems. This is very useful, and later on it always turns out that the 
poor physicist has the come back and say, ‘Excuse me, when you wanted to tell me about 
four dimensions ... . The mathematical rigour of great precision is not very useful in 


physiecs”’. 


1) Tt was noted by Bertrand Russell that the axiomatic method has many advantages, 
similar to the advantages of shirking honest work. 
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Moreover, it is an error to believe that rigor 
of argumentation would be adverse to simplicity. 
(D. Hilbert, “‘Mathematische Probleme’, Paris, 
1900 [85]) 


12.1.2. Mathematical Models 


A mathematical model for a physical discipline can be subdivided into five 
components (Fig. 12.1). 


moth. CED i ie 
theory phys. reality 
inter— 
pretation 
comporison Fig. 12.1 


1. A given field of physical reality, such as mechanics, electrodynamics, ther- 
modynamics, relativity theory, quantum mechanics etc. What is meant here is 
the whole set of physical processes classified under the respective keyword. This 
is not a very precise description, but in concrete cases the meaning will be clear. 

2. An intramathematical theory which (as far as its internal logical structures 
are concerned) has nothing to do with physics. 

3. A rule of translation that relates physical quantities to corresponding 
mathematical ones. 

4, When translated, the theory yields data (real numbers) that must be inter- 
preted as physical quantities. 

5, Finally, the theoretically obtained, interpreted data must be compared 
with corresponding experimental material. If the result of this comparison is not 
fully satisfactory, this means a criticism of the translation, the interpretation, 
and of the expediency of using the particular mathematical theory concerned. 
It is not, however, a criticism of the internal structure of this theory, which can 
at best be criticized from an intramathematical, but not from a physical point 
of view. 


Physical laws should have matheinatical beauty 
(P. A. M. Dirac) 


12.1.3. Criteria for Models 


1, A model is good if it works. In this sense there are no true or false models 
but only suitable and unsuitable ones. Here the term “suitable”? means that, in 
the sense of Subsec. 12.1.2., all effects of a defined field of physical reality can be . 
described within a given accuracy of measurement. Consequently a model is 
characterized not only by the subject it describes but also by the accuracy of 
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this description. To give an example, classical pomt mechanics is an excellent 
theory, provided that the velocities involved are small compared with the veloc- 
ity of light. It is only for high velocities and high accuracies that it must be 
replaced by the special theory of relativity. 

2. Mathematically, a model should be as simple as possible without making 
concessions with respect to mathematical rigor. Hence there must be substantial 
physical arguments in order that a theory be replaced by a new, more complex 
one. At the time of Newton, special relativity (in competition with classical 
mechanics) would have violated this postulate. 

3. There is a hierarchy of theories and associated models. A new theory covers 
a larger field of reality with a higher accuracy than did the corresponding old 
theory. History shows that the inventory of mathematical means used in these 
theories becomes more and more complex. Very likely the full potential of a 
theory can be surveyed not until the next better theory is made out in outlines. 
During the development of point mechanics, our ancestors in physics believed 
that it would give a definite description of everything that happens in the world. 
They had been dazzled by the glory of this theory, and hugely overestimated its 
potential (hindsight is easier than foresight !)). At least the physicists have not 
made this mistake a second time: theories and models always give only approxi- 
mative descriptions. 

4. An essential justification of dealing with theories and models lies in the 
ability of the latter ones to predict new effects. There are impressive examples 
of this ability: Einstein’s prediction that light rays are deflected by heavy mas- 
ses, or Dirac’s prediction of the existence of positrons. Formulas are indeed wiser 
than men.~2) 


Remark. We quote a few sentences from R. Feynman, The Character of Physical Law: “The 
moons of Jupiter appeared to get sometimes eight minutes ahead of time and sometimes 
eight minutes behind time, where the time is the calculated value according to Newton’s 
Law ... O. Roemer (1644-1710), having confidence in the Law of Gravitation, came to the 
interesting conclusion that it takes light some time to travel from the moons of Jupiter to 
the earth,... In this way he was able to determine the velocity of light. This was the first 
demonstration that light was not an instantaneously propagating material. I bring this 
particular matter to your attention because it illustrates that when a law is right it can be 
used to find another one. [f we have confidence in a law, then if something appears to be 
wrong it can suggest to us another phenomenon’. 


12.1.4. An Example 


In this book we shall deal with several physical theories, following closely the 
model thinking described above. To illustrate this, let us consider here a simple 
example. 


') “Given for one instant an intelligence which could comprehend all the forces of nature 
and the respective situation of the things that compose it... for it nothing would be uncer- 
tain and the future, as the past, would be present to its cyes” (Laplace, Philosophical Kssay 
on Probabilities, 1812—1820). 

2) Heinrich Hertz: “One cannot escape the feeling that these mathematical formulas have 
an independent existence and an intelligence of their own, that they are wiser than we are, 
wiser even than their discoverers, that we got morc out of them than was originally put into 
them”. 
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General scheme | Application to mechanics Special example (12.2.2.) 
1. field of | classical point falling 
physical mechanies stone 
reality 
2. mathema- extremal principle, oL doL 
tical Eulerian equations indi oe 
theory 


P= & a= S24 ottey 


3. translation Lagrangian function 


m 
. L(t, 2, )= > 2+ mex 
L(t, 23, 2) (f, &, #) 5 8? +g 
4. inter- a,(t) trajectories of a(t): way 
pretation particles 
5. comparison _ Galilei on the leaning 


| tower of Pisa 


eer The Model for Point Mechanics 
12.2.1. Hamilton’s Principle 


The field of physical reality to be considered here is a system of m mass points 
moving in the three-dimensional Euclidean space. These mass points are acted 
upon by internal and external forces. To give an example, gravitational forces 
are internal, whereas electromagnetic fields acting upon charged particles are 
external forces. Suppose that the position of the j-th particle at the time ¢ is 
(233 -9(¢), 3; -1(0), v3;(¢)) €. Ry. Hence the total system is described at the time ¢ 
by a point (2,(é), ..., tan(¢)) € Ran, with the interpretation given above. To every 
system there is associated a Lagrangian function L(t, x,(t), #;(¢)), which must be 
determined by means of physical considerations. 


Axiom (Hamilton’s principle). A system of n particles with the Lagrangian func- 
hon Lit, 70), «., Zan(t), Bi(0), «+ Lani) moves so that =(240), --; Zant) 1s an 
extremal of the associated variational problem with fixed boundary values. 


Remark 1. This in effect determines the cycle pointed out in Subsec. 12.1.2. The mathe- 
matical theory is the calculus of variations referred to in Subsec. 11.1.3. Translation is 
accomplished by finding a suitable Lagrangian, L(t, #1, .... @3n, 44, .... £3n). The axiom and 
the theory of Subsce. 11.1.3. lead to the Euler-Lagrange equations (11.1.3/2) (here it is 
always assumed that J has the necessary differentiability properties). Their solution gives 
trajectories (x,(t), .... @3n(¢)) in Rz,. The interpretation then identifies (%3j;—-9(t), w3j—1 (4), 
a3,(t)) as the position of the j-th particle in the physical three-dimensional space. 


Theorem. The equations of motion of a system of n particles with the (twice contin- 
uously differentiable) Lagrangian function. L(t, x1, .., jn, Py, +, Lyn) are the 
HKuler-Lagrange equations 
aed fay, 
— |= 0. i — ee ere (1) 
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Remark 2. The theorem is an immediate consequence of Theorem 11.1 .3/1. The equations (1) 
are also ealled Lagrange’s equations of the second kind. The above considerations show that 
the problem is reduced to two questions: (i) determination of L on the basis of pysical prin- 
ciples, and (ii) solution of the system of differential equations (1). If the determinant of 
92 \n 
(= OE is always different from zero, then (1) can be resolved for %), ashas been stated 
1OXK JIL k=1 

in Remark 11.1.3/4. The theorems of Chap. 4 then show that (1) has a local unique solution 
which assumes given values (fo) =a; and £;(f9) =b;, j=1, ..., 3n, at a given time fy (positions 
and veloeities at fo). 


12.2.2, An Example (Free Fall) 


Let us explain the example of Subsec. 12.1.4. more closely. We consider the 
free fall with x=0 for :=0. Let 


L(t, x(t), £(2)) = 24+ mgx= T—V 
= kin. energy — pot. energy . 


Here m is the mass, g is the acceleration of gravity, and V = — mgz is the poten- 
tial energy. (12.2.1/1) then reads ¥=g, giving x(t) = @+ct+c,. From x(0)=0 
and ¢,;=2(0)=v, (initial velocity) one obtatns z(t) -& (2+ vo. 


12.2.3. The First Integral of Motion 


Theorem. /f the Lagrangian function 
Lit, Xi, o2+> U3n5 ce sees Dy as seey L3n5 ae eeey L3n) 
does not explicitly depend on t, then for every trajectory 
3n OL 


= — %,=E=const (first integral of motion). (1) 
R21 OlE 


Remark. Substituting a solution of (12.2.1/1) into the left-hand side of (1), one obtains a 
function of ¢. The theorem says that this function of f is constant. (1) is interpreted as an 
energy conservation law. The example of Subsec. 12.2.2. yields (7'+ V)(t) =E. 


...the felicitous elucidation, dedueed from 
this hypothesis, of most of the phenomena of 
nature furnishes sufficient evidence that it be 
absolutely true. 

(Euler on Newton's law of gravitation, 1760) 


12.3. Systems of n Mass Points 
12.3.1. The Basic Model 


Consider » point-shapcd particles of masses m7, ..., my, in the three-dimensional 
physical space (e.g. celestial bodies). Suppose that 


eV (a 2) — Con) ay 4 — Dap 1) + (x93 — Zan)? 
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be the distance between the j-th and the k-th particle (Fig. 12.2). Here the co- 
ordinates are numbered as stipulated in Subsec. 12.2.1. The gravitational 
potential of the system is 


: 1 MjM}, 
eV olen S005 t3n)= —= ¥ 5) 
2 5ek Ti 


where y is the constant of gravitation.!) Further suppose that an external 
potential V,=V,(t, x), ..-, Yan, £y «-, Zn) 18 given (e.g. an electromagnetic 
potential). On the basis of physical experience and physical principles, which 
are exterior to mathematical axiomatics, the following set-up is made for the 
Lagrangian function: 
n 
Et 4 ony Bn) = 2, “S(O -a + Bean + i) =P (1) 


= T— V =kin. energy — pot. energy 


where ) =V,+ V, is the potential energy. The trajectories of this system are 
solutions of (12.2.1/1). 


Theorem. Jf V=V(t, xy, ..., X3n) ts independent of xp, then the trajectories of the 
system are solutions of 


Mpa 34 —r(t) = — = Ap EM an m Cae Pa), Il, 2c (2) 


3k 
Remark. This follows immediately from (12.2.1/1) and the above set-up for L. The inter- 
pretation shows that %,=(23,~9, @34—1, €3,) is the position of the &-th particle, and that 
av ov oV 
Oe3r-2°  O&3x 4° OX3x 
ing upon the /-th particle. Then formula (2) above is Newton’s second law of motion: 


(bape %3,-1, €3y) 18 its velocity. R,=(— is the force act- 


force = mass < acceleration. 


12.3.2. Foree-Free Systems 


The system of Subsec. 12.3.1. is called force-free if V(t, x, ..., %3,)=0 in 
Kq. (12.3.1/1). In other words: the internal forces are neglected, V,)=0, and 
external forces are absent, V,=0. 


Theorem. The trajectories of a force-free system are 
Rit) = GE 4 Ue — ee : (1) 


where the a; and bz, are real numbers. 


1) “This law has been called ‘the greatest generalization achieved by a human mind’... 
Therefore our main concentration will not be on how clever we are to have found it all out, 
but on how clever nature is to pay attention to it” (R. Feynman, The Character of Physical 
Law, with respect to Newton’s law of gravitation). 
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Remark. The theorem follows from (12.3.1/2). According to the interpretation in Subsec. 

12.2.1., the k-th particle moves in the straight line x(t) =d,t A where @,=(d34-9, @3¢-15 

dx(t) 
dt 


a3,) and b, =(bap_9. b3~—1. bg). The velocity of the particle is =a. So the particles 


in a force-free system move uniformly in a straight line. 


Historical uote. We quote the famous three “Axioms, or Laws of Motion” of 
Ie ewtonn(los6), ct. (46) p. 13): 

Lex prima (Law I). Every body continues in its state of rest, or of uniform 
motion in a right line, unless it is compelled to change that state by forces 
impressed upon it. 

Lex secunda (Law II). The change of motion is proportional to the motive force 
impressed; and is made in the direction of the right line in which that force 
1s impressed. 

Lex tertia (Law III). To every action there is always opposed an equal reaction: 
or, the mutual actions of two bodies upon each other are always equal, and 
directed to contrary parts. 


12.3.3. Conservative Systems 


The system of Subsec. 12.3.1. is called conservative if V(¢, x, ..., tan) = 
V (x4, -.-) %3n) of Eq. (12.3.1/1) is independent of ¢, %,,..., Gan. The kinetic energy T 
has the same meaning as in that formula. 


Theorem. For the trajectories of a conservative system, 


3n 

~ Mk , 29 oD -9 = 

> 5 (F3,-2 + Har_-1 + £3) + Vey «5 Lan) = T+ V=E= const 
k=1 4 


(energy conservation law). 


Remark. The theorem follows from Theorem 12.2.3. So the total energy of a conservative 
system remains constant in time. 


12.3.4. Particle in a Potential Well, Harmonic Oscillator 


Consider a particle of mass m that moves in Ry. The particle is acted upon by a 
force K(x) = —1V’’(x), x€ Ry, which is generated bya potential V(x). According to 
m 
z 
function. This is a one-dimensional conservative system as defined in Subsec. 
12.3.3., and one obtains 


Subsec. 12.3.1. one has L(t, 2, #)= . @2—V(x) as the associated Lagrangian 


m 
2 
for the trajectory x(t) of the particle. Eis a characteristic quantity of the system 
(energy), and (1) gives 
2 
d2=— (E—V(z)). (2) 


m 


eS (a) = 1 const (1) 
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ya 


This implies that V(x) =E. Hence the particle cannot leave the potential well 
(Fig. 12.3). Integration of (2) gives 


m if : 
trie 2/" | —— = (3) 


Tf (without loss of rare the positive sign is chosen in (3), then this means 
that the particle moves from ag to the right. The question is whether the particle 
will arrive at the point x, within a finite time. 


Theorem 1. Let V (a) be twice continuously differentiable on Ry. 
(a) If V’(a,)+0, then the particle arrives at x,=a(t,) after a finite time, and 


ie. yz | | ” (convergent improper integral) 

—f,= —_ See NVE} GP ; ; ; 

i= 0 aa 

(b) Lf V’(a,)=0, then the particle does not arrive at x, (creeping approach), and 


(divergent improper integral). 


V(x) 


Fig. 12.4 


Harmonie oscillator. The special case of the harmonic oscillator, where V(x)= 
kt : : , 
=5 v4(Fig. 12.4), isof particular interest. The force A(x) = — kx actson the particle 


to pull it towards the origin from every nonzero position (restoring force). 
An elongated steel spring attached to the origin is a technical realization of such 
a restoring force. 


Theorem 2. If x(0)=0 (initial position) and x'(0)= v9 (initial velocity), then the 
harmonie oscillator oscillates about the origin: 


x(t) =U = sin ain ae (4) 
Remark. (4) is the solution of the equation of mction m% = —ka (cf. Rq. (12.3. i/2)), taking 


into account the above initial conditions. According to (4), %max=Vo Ve = is the maximum 


m. 
deflection and T=27 12 is the period of the oscillation. 
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Look to the Heavens, and learn from them 
How one should really honor the Master. 

The stars in their courses extol Newton’s laws — 
Tn silence eternal. 

(A. Einstein on the three-hundredth anniversary of 
I. Newton’s birth, 1942) 


12.4. Planetary Motion 


12.4.1. Formulation of the Problem and Basie Model * 


Kepler’s laws found empirically for the planets’ motion about the sun read as 
follows. 
Kepler’s Ist law: The orbit of a planet is an ellipse that lies in a fixed plane. 
The position of the sun is in one of the two focuses. 
Kepler’s 2nd Jaw (principle of conservation of areas): In equal periods, the 
radius vector drawn from the sun to the planet describes equal areas. Let Fy 
and fF, be the areas of the shaded (curvilinear) triangles (Fig. 12.5). If t,;—¢,= 
i,—ts, then #',= Fy. Here the ¢; are the times at which the planet passes through 
the respective positions. 
Kepler’s 3rd law: Let a, be the major semi-axis of the ellipse, and let 7, be the 
period of revolution of a first planet. Let a2 and 7’) be the corresponding para- 


pia eys 
a 
meters of a second planet. Then Ae =-, : 
oS 
Planet 
7 r 
0 
sun 
Fig. 12.5 Fig. 12.6 


The basic model. It is our aim to deduce Kepler’s laws from the previous consi- 
derations. The problem is treated as a one-body problem. The sun, with the mass 
M, is fixed to the origin of the fs, and the celestial body of mass m moves in 
an orbit (x(¢), y(¢), 2(¢)) about the sun (Fig. 12.6). tis the time. If r()=Vr24 y24+22 
is the distance from the origin,we make the following set-up for the Lagrangian 
function: 

(1) 


Bane WP. : er Mim 
TAC 2 es 2) 5 (42+ 92+22) +7 me 


Mm 


Here — y is the gravitational potential, and y is the constant of gravitation. 


' 
This agrees with the set-up of Subsec. 12.3.1., except that one mass point, namely 
the sun, is regarded as being fixed: the physical justification of this is that the 
mass of the sun is much greater than the mass of all of the planets, so that the 
influence the planets have on the sun can be neglected. 
Polar co-ordinates. We use the spherical polar co-ordinates known from ana- 
lytical geometry (Fig. 12.7): 

reine cos g, O27 -< 67, 

y=rsindsing, O<d<nx, 

zZ=r cos ?, 0Sp—2n . 


Q* 
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The singularities at r=0, 0-0 and @--x will not disturb our considerations. 
Transformation of (1) tor, 0, and @ gives 


, m.. a ne Mim . 
Lb, &,Y,2, t, 9, A) =~ (P +02 + PG? sin? 8) + Y , 


= Lit, YT, a; P> % b, Pp) . 
According to the axiom of Subsec. 12.2.1., the desired trajectory is an ex- 
tremal of 


tf i fi 
f LU, 245% 4, 9,8 dt=f Lit, r, 9, 9, 7, 9, ) at. 
to to. 


As the value of the integral is invariant under the transformation from Cartesian 
to spherical polar co-ordinates, the extremal of the transformed integral is 
identical with the original one. Thus, according to Theorem 11.1.3/1, the desired 


trajectory is a solution of the Euler-Lagrange equations associated to L of 
formula (2) above: 


ab d fab Re Lee Mm d,s. ant 
— Fray (gg) =r Osim De uy 8) 
ab d fab d 
Sa 2p sin? O 
ap dt a ca Gey 6 
eb a (ab ! Cl ese 
Say, (55) = mee sin ? cos 8 —m Al (72) . (5) 
| 
| 
7 
pa) 
(ee 


Fig. 12.8 


Initial values: Suppose that at ¢=0 the celestial body has its shortest distance 
froin the sun (at least compared with ¢¢€(—6, 6), 6+0). This only excludes the 
case that the celestial body falls into the sun. Let a(t) =(#(d), G(d), 2(¢)) be the 
velocity vector tangential to the trajectory, then we choose the 2,y,z-co- 
ordinate system so that 4(0)=2(0) =2(0) =0 (Fig. 12.8). In spherical polar co- 
ordinates this gives 


H0)=5, 9(0)=0, 1(0)=ry=0, (6) 


H(0)=0, G(0)=v, *(0)=0. 
The system of equations (3)—(5) can be resolved for #, ’, and 7. The theorems of 
Chap. 4 then show that (3)—(5) has a unique solution with the initial values (6). 
(This has been established at least locally in a neighbourhood of ¢=0, but will 
turn out to hold also globally for all ¢-values.) This then is the desired trajectory. 
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12.4.2. Plane Orbits, Kepler’s Second Law 


Theorem Ll. 77 r(¢) and @(é) are a solution of 


- , Min n : 
0=mrg¢?— sa mt, r(0)=79, 7(0)=0, (1) 
a : 
O=m 5 (74), 90) =0, GO)=0, (2) 
then r(t), ¢(t), Wes are the solution, unique in a neighbourhood of t=0, of 


(12.4.1/3)—(12.4.1/6). 


= into (12.4.1/3)— 
(12.4.1/6) gives the above system of equations. The rest follows from the uniqueness 
theorem. The problem is thus reduced to (1), (2). 


Remark 1. The theorem is astonishing and trivial: substitution of #(/) = 


Theorem 2. The orbit of a planet (or of another celestial body) Nes in a fixed plane, 
obeying Kepler's second law. 


Remark 2. The first part of the theorem follows from the fact that #(t)= Further, 


Tv 
i an a 20 
dF= 5 rrdp, and hence Sa, ae r@ (Fig. 12.9). From (2) it is now seen that F(t)=c= 


const. But this is the principle of conservation of areas. 


= 


ag rd 


dF Fig. 12.9 


12.4.3. Kepler’s First Law 


From Theorem 12.4.2/1 it follows that Pp=rgv=h+0 (with h+0 we only 
exclude the case that the planet falls into the sun, p(¢)=0). Here h is the con- 
stant in the law of areas, which is assumed to be known (observational data). 


: : 1 ; 
So ¢(é) is strictly monotone, and we can write r(t) =r(d(v)) =—— as a function 


d2o yM o(¢) 
of m. Eq. (12.4.2/1) then gives dg? = eg ae Taking into account the initial 
values, one obtains the solution 
YM tye 
y= —— + (-——- COS Dp. 
(?) pe (- i " 
Hence 
Pp he 6 yl 
)) = ———_—_ wher — nd —=——~—. iL 
(9) 1+ecos@ a ~M a p rm (1) 


Here it is possible to determine p and ¢ from astronomical data. 


Theorem (Kepler's first law). The orbit of a celestial body as deseribed by (1) ts « 
curve of second degree (that lies in a fixed plane). The position of the sun vs in one 
of the focuses. The curve of second degree 1s 
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an ellipse (planetary orbit) for |e|<1 (Fig. 12.10), 
a parabola for e=1, 
a hyperbola (cometary orbit) for e>1 (Fig. 12.11). 


0 
SUT 


Fig. 12.10 NS ate eat 


Remark. The theorem is proved by returning to x,y-co-ordinates in the above formula (ith. 
According to our convention that @ =0 defines the orbital point nearest to the sun, we may 


' : P : se 
assume that 0 se. In the case of the ellipse one obtains a =F for the major semi-axis. 


e2 
12.4.4. Kepler’s Third Law 


Theorem. Let a be the major semi-axis of the ellipse of a planetary orbit, and let 
} T2 4x2 
T be the period of revolution of that planet, then —-=—_. 
Ce SB! 
sip 
Remark 1. From the law of areas it follows that F =—. If b is the minor senu-axis of the 
ellipse, then rab is its area. T can be calculated from ~ 
£ 


b — (aa 
Ta a Fine ol % 
0 


~) 


a 
With the use of 8 “V5 h, one obtains the assertion of the theorem. 
9 


Remark 2. Since a does not depend on the particular planet, this leads to Kepler's 
third law. : 


13. Measure Theory 


Hie Classes of Sets 


13.1.1. Algebras and o-Algebras 


In Chap. 9 we considered the Riemann integral in R,. At many points there 
an intuitive argumentation was necessary. It is now the aim of Chaps. 13 and 14 
to present a substantial generalization of the Riemann integral. The approach 
will be abstract, and will do without intuitive aids. 


MEA. 13.1. Classes of Sets 107 
EE ae 


Sets. We recall some concepts of set theory (Fig. 13.1). Let X be an arbitrary 
(abstract ).set. A set E with Ec X is called a subset of X. If 8c X and FCX, then 
one defines 
eminomeaWitiliece or «rely. 
the intersection EQN F={r|xe€E and r€F}, 
_ the complement £’ = X\E={x|xr¢éE}, and 
the difference E\F={r|xeE and x¢F}=ENF’. 


We make the convention that 0c X (0 being the empty set). 


ENF 


V . 5 
OE Vig. 13.1 ee ee eee Fig. 13.2 


Definition. Let X be an arbitrary non-empty set. Let U be a non-empty class (or set) 
of subsets of X. 
(a) WU 2s called an algebra if it has the following properties: 
if FEY, then E’ EY, and 
if EeMand FEM, then EUF EY. 
(b) M zs called a o-algebra if it has the following properties: 


af Ee, then E’ €U, and e 
W se Nvorj—1, 2, 3,...., then U Be. 
ja 


Remark 1. The elass P(X) of all subsets of a given set X (power set of X) is an algebra as 
well as a o-algebra. 


Remark 2. Let X= AR, (Fig. 13.2). Let a subset E of R, belong to the class Y if and only if 
£ is a finite union of half-open intervals [a, b). Here the intervals (—~, 6) and [a, ~) as 


ie 1 
well as £; and § are admitted to competition. % is an algebra. But (0, 1) = Ee 1} shows 
that 2% is not a o-algebra. n=2 L% 


Theorem. (a) Every o-algebra is an algebra. 
(b) Let Ube an algebra. Then VEX and XEA. Further it follows from E;€A for 
Nee I Pe Tha 
N 


wv 
U FU and 1 £;eN (N=1, 2,3, vee) 


j=1 p= 
(ce) If Mis a o-algebra, then it follows from H;EY for 7=1, 2, 3, ... that U H;EYA. 
foil 
Remark 8. Consequently, algebras are closed under complementation and under the for- 


mation of finite intersections and finite unions. g-algebras are also closed under the for- 
mation of countably infinite intersections and unions. 


13.1.2. Extension Theorems 


Let 9 and % be non-empty classes of subsets of the given set X, then BY 
means that every set of Y& also belongs to ¥. For a given 9, we ask for the 
smallest class § that includes Y and is an algebra or a o-algebra. In other words, 
if€ is an algebra or a o-algebra such that © DY, then © > Y should also be true. 
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Theorem. Let Y be a non-empty class of subsets of X. 
(a) There exists a smallest algebra that includes U. It is denoted by R(A). 
(b) There exists a smallest o-algebra that includes X. It is denoted by S(A). 


Remark. There arises the question as to whether R() and S(%) can be constructed. For 
N 

A(M) there is the following possibility. Let % = and Mit =|# | £’e%, or H=U E, with 
jel 

[Be af AH, Hy Ys ceac 


Lemma. RY) = U W,. 
k=0 


13.1.3. Borel Sets in R,, 


Let us extend the example of Remark 13.1.1/2 and generalize it to the n-dimen- 
sional case. Let X= Ry, and let Q={x|x€ Ry, aj< Die Ge) alent ane 
so-called n-dimensional open interval. As usual, we set x= (Ula ean) 


Definition. 7/9 ts the class of n-dimensional open intervals in Ry, then the elements 
of S(O) are called Borel sets. 


Remark 1. The Borel sets are said to be generated by the n-dimensional open intervals. 
With the aid of Lemma 13.1.2, it is not so difficult to imagine the sets of R(Y), whereas 
general Borel sets have a more complicated structure. 


Theorem. The Borel sets form the smallest o-algebra that includes one of the 
following classes: 

(a) the class of all n-dimensional open intervals {x | aj; =< 2j3< 63}, 

(b) the class of all n-dimensional closed intervals {x | Qj = x; = d;}, 

(¢) the class of all n-dimensional half-open intervals {x | 03S xj<b;}, 

(d) the class of all open sets in Ry, 

(e) the class of all closed sets in Ry. 


Remark 2. Tn particular, the set {x} consisting of the point x¢ R, alone is a Borel set. 


Consequently, U {a;} is also a Borel set. Hence, for n=1, {a |x rational} is a Borel set. Then 
j=l 
the complement {x |x irrational} is also a Borel set in Ry. 


13.2. Elementary Measures and Measures 
13.2.1. Definitions 


Let X be an arbitrary set, and let 9{ be an algebra of subsets of X. We consider 
non-negative set functions j defined on %: to every HEY there is assigned a 
number u(#) such that O=p(F)S-<. Here it is admissible that UH) = 0, As 
regards calculation with the symbol <, the following relations are adopted: for 
any real number a, one has a 4+ c= 0+, 


Definition. (a) A non-negative set function pw on an algebra YI is called an elemen- 
tary measure if at has the following properties. 
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pat BEX, ae 23, ,00@ (z= 0 fa ek, and af U H;EM, then 
oo j=1 
lt =o) u(E;) (o-additivity, or countable additivity). 
ae 
2. hc =e 
3. There exists a sequence aye sets E;EM, with ByoRycH3c, .., and u(y) <0 


(Or j= 1a, 3) SNC. nat N= U Li; (o-finiteness). 
j=l 


(b) An elementary measure that is defined on a o-algebra 7s called a measure. 


Remark 1. If %& is a c-algebra, then U E,;€Wfor £;€U. For an algebra W, U E;€ had to 
=! =I 


be postulated explicitly in the above definition. Frequently o-fmiteness is not included in 
the definition of the elementary measure. For our purposes, however, the above formulation 
will be sufficient. 


Remark 2. An elementary measure y is called finite if u(V) <. 
Remark 3. If 2{ is the o-algebra of the Borel sets in R,, then 


WB)={) ie ae EEX, is a finite measure. 
Tt is called Dirac’s measure. Let ay, .... a, be fixed points in Ry, and let u(£), with FEY. be 
the number of those points @, ..., a, which lie in Z. Then «(#) is a measure. 


13.2.2. Properties 


Since measures are also elementary measures, it will suffice to formulate 
propositions for elementary measures. 


(a) If E;€U for j=1,...,N,and 1f Hj; Ep=9 for j+k, then u 
(additivity). 

(b) If FEA, FEU, and ECF, then p(E)= pF) (monotonicity). 

(e) iB es a oe or countably infinite number of E;@M and for FEU, Ec UE;, 
then w(F)=s Dul#;) (subadditivity). 

(d) ie oe a eh, or countably oats number of E;€U with LE; ky, =0 for 
j+k and for FEU, UE; CH, then > w( Ej) Su(4). 


(e) If EjEU, with EycoRgcEyc ..., and if U Be, then 
ja 


SE 


Theorem 1. Let u be an elementary measure on the algebra YU. y 7) = 
(,U j=l 


= il 


o( U By) =I =lim u(H)) . 
k=1 

(f) If Bye, with By, DE yDEz> ..., and uf Q E; eM and if there exists a jy such 
that os. )< co, then al 


a ¢ a By) =I =lim p(B; « 
k 


il 
Remark 1. The proof is eee simple. The condition that «(#;,)<- for a suitable jg in 
part (f) of the theorem cannot be dispensed with. (e) and (f) are continuity properties. 


Remark 2. If Ec, FEW, EDF and p(E)<-~, then the following subtraction property 
holds: «(A\F) =(£) —"(F). 
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Remark 3. If Wis an algebra, then a non-negative set function ~ on is said to be finite if 
((£) << for all HEY. A non-negative set function « on Y is said to be additive if H(G) =0 
and (BUF) =(E)+(F) for all FEW and FEM with EA F =O. 


Theorem 2. Let u be a non-negative, finite and additive set function on the algebra MX. 


(a) uw ts an elementary measure tf u( U x) =lim u(4)) for all BE; eX with OB Ee 
Made Dies (0 5a) Day ene j=t jaa =) 

(b) uw ts an elementary measure if u(E;)}0 for all E;€% with HDs 2. and 
(i t5=0: 
psd 


Remark 4, Consequently a non-negative, finite and additive set function is an elementary 
measure if and only if either of the two continuity properties (e) or (f) of Theorem 1 holds. 


13.2.3. Heine-Borel’s Theorem 


This is not a theorem of measure theory but one that has been omitted in the 
chapters on the differential and integral calculus. 


Theorem. Let E be a closed, bounded set in Ry, and let F 4, Fo, Fs, ... be open sets 
a N 
in Ry, such that ECU Fy, then there exists a natural number N such that E eu Fr; 


(Fig. 13.3). t= j= 


Fig. 13.3 


Remark. In other words, if E is covered by a (countably) infinite number of open sets, 
then finitely many of these sets suffice to cover HE. 


13.2.4. Elementary Borel Measures in R, 


It is our aim to characterize measures on the Borel sets in R,. Let Y be the 
algebra referred to in Remark 13.1.1/2, then it follows from Theorem 13.1.3 
that the Borel sets form the smallest o-algebra S(Q) that includes 9. We call 9 
the algebra of the pre-Borelian sets. First of all, let us describe all elementary 
measures on Y{. We consider increasing, left-continuous real-valued functions 
p(t) on By, i.e., p(éo) =e p(t) (Hig. 13.4). Here it is allowed that y(t) be constant 


on intervals (i.e., not strictly monotone). If 
plc) =m 9%), o(—=)= lim it) , 


then the values 9(:7)=co and g{—) =—oo are permissible (but (cc) and/or 
y(—s) may also be finite). For the calculation with o we adopt the usual 
relations: @ + co= <0, q —co= —co for any reala, — ( — cc) = o0, 
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Fig. 13.4 Se icw 
a by a, by Qs 
Theorem 1. Let a,<b,<a,<b)=< ... <ay<by (where a,= —- and/or by=— ts 


n 
riggs and let E= U [aj, bj) (Fig. 13.5). If @(0) ts the function described above, 
then Tes! 

n 


K(E)=> (9b) —e(a))) where p(0)=0 (1) 


j=i 
7s un elementary measure on the pre-Borelian sets. 


Remark 1. The above theorem is plausible. In the case a;= —- one must of course sub- 
stitute (— oe, b,) for [a;, 6). The proof of a-additivity is, however, somewhat tedious. 


Remark 2. If ¢ is a real number, then g(t) +c generates the samc elemantary measure ; 
as g(t) does. Hence g(t) can be normalized by (0) =0. 


Theorem 2. [7 « 7s an elementary measure on the algebra of pre-Borelian sets, then 
there exists a real-valued, increasing, left-continuous function y(t) that generates pu 
in the sense of Theorem 1. y(t) is unique apart from an additive constant (Fig. 13.6). 


Fig. 13.6 


S 


Remark 38. The function 

LLCO, €)) for t>0 

0 for t=0 (2) 
—([t,0)) for t<0 


p(t) = 


is the desired generating function. 


Remark 4. The two theorems (and the construction formulas (1) and (2)) provide a one-to-one 
correspondence between increasing functions on A, and the elementary measures on the 
pre-Borelian sets in A. 


13.2.5. Elementary Lebesgue Measure in R, 


Setting g(t)=¢ in (13.2.4/1), one obtains the so-called elementary Lebesgue 
measure on the algebra of the pre-Borelian sets in Ry. One has u([a, b)) =b—a. 
We shall now generalize this elementary measure to the n-dimensional case. 
The algebra of pre-Borelian sets in #, is constructed as follows. Let 


C= een e=e-—0; for 7H ,...., 7} (1) 


be an n-dimensional half-open interval. Here —coSaj<o and —o=<bjsem. 
If aj= —, then a;Sa2;<b; must be read as —o=<.1j;<0j.) 
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N . . . S . 
Lemma. U%={8 | H=U Qz, Qy being an n-dimensional interval as defined in 
fal 


(1)} ts an algebra of subsets of Ry (the algebra of pre-Borelian sets in Ry (Fig. 13.7)). 


Fig. 13.7 Fig. 13.8 


Remark 1. N is any natural number. 6€% (to show this, one may choose a;=0; in (1); 
otherwise one naturally assumes that always a;<6;). Hence a pre-Borelian set H is a finite 
union of (possibly unbounded) n-dimensional half-open intervals. The “left” and the “lower” 
ends of # belong to #, whereas the “right” and the “upper” ones do not. For n=1 one 
obtains the previously defined algebra of pre-Borelian sets. 


Remark 2. According to Subsec. 13.1.3., the class of Borel sets is the smallest o-algebra 
S(M) that includes the algebra of pre-Borelian sets. 


Remark 3. We generalize the elementary Lebesgue measure in ft, as follows. If H=@ has 
n 
the form (1), then let «(Q)= J] (6; —a,) (here we assume that b;>a;,@ > co=o for a>0). If 
esl 
HE is an arbitrary set, then there exists a representation (not a unique one) 
iV 
E=U QO, 9,9Q;=0 for kl, (2) 
ail 


where the Q;’s have the form (1) (Fig. 13.8). We set 
iV 


WH) = 2 p(Qe) « 
ial 
The question now is whether p(Z) is independent of the kind of representation. 


Theorem. (a) «(2) is independent of the kind of the representation (2). 
(b) u(E) (with u(0)=0) vs an elementary measure on the algebra of pre- Borelian 
sets. (Lt is called elementary Lebesgue measure.) 


13.3. The Outer Measure, Extension of Klementary Measures 
13.3.1. The Outer Measure 


The aim of the following considerations is to construct measures on o-algebras. 
Here the o-algebra of the Borel sets in Fy, is of special interest. For the time 
being, the constructions of Subsecs. 13.2.4. and 13.2.5. only provide elementary 
measures on the algebra of the pre-Borelian sets in ,. Now the question arises 
whether these elementary measures can be extended in such a way that they 
become measures on the Borel sets. This will be realized in several steps, where 
we consider the abstract case. 


Definition. Let be an algebra of subsets of the set X, and let uw be an elementary 
measure on UX. If E is any set of X, then 


w*(E) =inf > u(it;), Hew i;, E,eN, (1) 
je 


j= 
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ws the outer measure (corresponding to uw). The infimum ts taken over all systems 
{E}ez,cCU wih ECU #;. 

j=1 
Remark. In contrast to ”, «* is defined on every subsct of XV. 


Theorem. The outer measure p* has the following properties. 
(a) 0=27( 2) Sc, p*(S—0. 
(b) w*(E)su*(F) for ECE (monotonicity). 
*(E | 


e*(E)s > plb,;) for ECU E; (subadditivity) 

j=l pei © 
) There exists a sequence E,c EycEy¢ ... such that U Fy= Xand w*(Ey) <0 
‘= 1, 2, 3, ... (a-finiteness). ja 


(e 
( 


) 
(d 
for k 


13.3.2. The Induced Measure 


We have the same situation as in Subsec. 13.3.1.: an algebra Y of subsets of 
the set X, an elementary measure y on Y, and a corresponding outer measure p*. 


Definition 1. A set E, ECX, zs called w*-measurable if, for all subsets A of X, 
w*(A)=n* (ANE) +u*(AN EB’). (1) 

Remark 1. It is our aim to construct measures. Measures have the property (1). This 

suggests to pick out those subsets which have the property (1) from the class of all sub- 

sets of E. 

Definition 2. A measure pu on a o-algebra € is called complete if the conditions 

ECG, u(E)=0 and FcE imply that F c&. 


Remark 2. Then it follows from (fF) S4(Z) that pif) =0. 


Theorem. (a) The class $ of all w*-measurable sets 1s a o-algebra. 

(b) If w*(E)=0 for EX, then E belongs to &. 

(c) The restriction fa(E)=u*(E) of p* to the sets FE € BS induces a complete measure 
i ona. 


Remark 3. 77 is called the measure induced by sc on B. 


13.3.3. The Extension Theorem 


What has been attained so far can be described as follows. The starting point 
is [X,%, w]: a set X, an algebra YA, and an elementary measure « on Y. In 
Subsec. 13.3.1. we have constructed the corresponding outer measure ~* on the 
class P(X) of all subsets of X:[X, P(X), u*]. By restriction of this measure 
to the o-algebra %S of the u*-measurable sets we obtained the induced measure 
fi on % in Subsec. 13.3.2: [X, B, @]. What is now of interest is the relationship 
between B and 9 and between fi and p. As usual, S(%) is the smallest o-algebra 
over Y. 

Theorem. (a) BDS(*). 

(b) The restriction of fi to S(Q) is an extension of 4, 0.€., if HEM, then f(L) = uF). 

(e) @ is a complete extension of @ on B, ie. @ is an extension of wim the sense 
of (b), and ji is a complete measure on %. 
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Remark. So we have reached the goal set out at the beginning of Subsec. 13.3.1. What 
remains is the question for the uniqueness of these extensions. We shall discuss this point 
in Subsec. 13.3.5. 


13.3.4. Borel, Lebesgue and Dirac Measures 


In Subsec. 13.2.4. we considered elementary Borel measures on the algebra 9 
of the pre-Borelian sets in R,. In Subsee. 13.2.5. we introduced the elementary 
Lebesgue measure on the algebra YY of the pre-Borelian sets in Ry. 


Definition. (a) A Borel measure in R, ts the extension (in the sense of Theorem 
13.3.3(b)) of an elementary Borel measure from the algebra X of the pre-Borelian 
sets in R, to the o-algebra S(X0) of the Borel sets in Rj. 

(b) The Lebesgue measure is the complete extension of the elementary Lebesque 
measure from the algebra % of the pre-Borelian sets in Ry to the o-algebra & in the 
sense of Theorem 13.3.3(c). 


Remark 1. According to Subsee, 13.3.2., B consists of the »*-measurable sets, where fois 
the elementary Lebesgue measure on the pre-Borelian sets & in R,. The sets of B are 
called Lebesgue-nieasurable sets, the sets of S(%) (i.e. the Borel sets) are also called Borel- 
measurable sets. 


Remark 2. LX, ©, v] is called a measure space if X is an arbitrary set, © is a o-algebra of 
subsets of XV, and y isa measure on ©. Aecording to the above definition we have two exam- 
ples: [#, o-algebra of the Borel-measurable sets in R,, Borel measure]. and [ Rp, g-algebra 
of the Lebesgue-measurable sets in R,, Lebesgue measure]. 


Remark 8. Consider an increasing step function g(t) on Ry, which has a finite or countably 
infinite number of jump diseontinuities i; (Fig. 13.9). The magnitude of the diseontinuity 
is given by 


aie p(t) — p(t;) 


where it has been assumed that (f) is again left-continuous). If E is a pre-Borelian set, then 


ME) = 2 oj (1) 
tek 


(where #4= > is not excluded) is the value of the eorresponding elementary Borel measure 
(Theorem 13.2.4/1). Here the sum is to be taken over those o;-values for which ¢; belongs to £. 
The corresponding Borel measure jf, as per Definition (a), is ealled Dirac measure. The for- 
mula (i) (with 7% instead of «) holds for arbitrary Borel-measurable sets. In particular, 
E({is}) =a, and g(R,) = > g;. If none of the jump discontinuities t; belongs to the Borel- 
incasurable set H, then #(£)=0, One says that the “mass” of the Dirae measure f@ concen- 
trates at the points ¢;. 


See 
i ; 
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13.3.5. Uniqueness Theorems 


We consider now the problem of the uniqueness of the extensions @ referred 
comm Eheoreny 1333.3. 


Theorem 1. The extended measure fi of Theorem 13.3.3(b) is unique: if » ts a 
measure on S(QM) such that u(E)=9(k) for HEM, then » =ji. 


Remark 1. In particular, the Borel measures are thus uniquely determined by the clemen- 
tary Berel measures. Every measure on the Borel-measurable sets is a Borel measure in the 
sense of Definition 13.3.4. 

Theorem 2. /f [X,©, e] vs a measure space, then there exists a smallest complete 
extension LX, ©, 6]. 

Remark 2. [.X.€. 9] is a measure space with © DU and 6(#)=o0(#) for £ EC. Further, 6 is a 
complete measure. This is the definition of a complete extension. The term “smallest com- 
plete extension” means: if [X, 8, ¥] is a complete extension of LX. G, g]. then LX, 8. 7] is 


also an extension of [.X, ©, 9]. 


Remark 3. It can be shown that LX, ©, #] of Theorem 13.3.3(c) is the smallest complete 
extension of [.V, S(), @] of Theorem 13.3.3(b). This also includes a uniqueness theorem for 
the Lebesgue measure of Def. 13.3.4: it is the smallest complete measure that coincides 
with the elementary Lebesgue measure on the pre-Borelian sets in f,. 


bot Measurable Functions 


13.4.1. Definition 


As stated above, a measure space [X, 8, u] consists of a set X, a o-algebra B 
of subsets of X, and a measure uw on %. We consider real-valued functions f(x) 
on X. If M is a subset of Ay, then 


fOUW) = fr | ce X, f(xz)eM} 
is called the inverse image of M (Fig. 13.10). 
F(x) 


f7(M) Fig. 13.10 


Definition. f(x) is called measurable (or a measurable function) if f-(M)E% for 
every Borel set M in Ry. 


Remark 1. Consequently the measurability of a function only depends on %, not on #1. 


Remark 2. If X=R,, then there are two important o-algcbras: Bp, the o-algebra of the 
Borel-measurable sets, and By, the o-algebra of the Lebesgue-measurable sets; cf. Subsce. 
13.3.4. Since Bp CBz, every Borel-measurable function (ie... 8=Bpy in the above defi- 
nition) is also Lebesgue-measurable (i.c., B= yz in the above definition). 
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Lemma, Let [X, 3, u] be a measure space. A function f(x) is measurable if and 
only if thé inverse image f—{{ —<, c)) belongs to B for every real number c. 


Remark 8. (— <=, c) is a Borel set in Ry. Hence f~!((—-, c))€% if f(z) is measurable. The 
lemma says that it suffices to check whether or not the inverse images of the special Borel 
sets (— >, c) belong to %. 
Remark +4. Let _X, %, s] be a measure space. Jf f(x) is measurable, then 

{x | eX, asf) = HCD (1) 
for — » Sc, =Cy =~. Here <, means that both <and & is possible in the above formula. So (1) 
includes four cases. If ¢,= — and/or cg=-, then one has to read <, as <. As a special case 


it is seen from (1) that /~1({c}) = {a | f(x) =c} is measurable for every real number c. 


Remark 5. Sometimes it is practical to admit f(z) = © and f(a) = —-. The above definition 
must then be supplemented by {x | f(z) =-0}€% and {x | flv) = — oe. 


13.4.2. Properties of Measurable Functions 


ee sie 1. Let es B, uw] be a measure space, and let f(x) be a measurable function. 
(a) If(x)|, f+ (vw) = max (f(x), 0) (positive part of f(r)) and f-(#) = —min (f(x), 0) 
(negative part 7 {(x)) are measurable. 


1 
(F)c = Fig for fa), 
Oo aa) 08 


then (FJoa vs measurable. 
(¢) For any real number A, (Af)(x) =Af(x) ts measurable. 
(d) Jf f(x) =0 and «=0, then f*(x) ts measurable. 
Remark. Consequently one has f=/+ —f- and |f|=/+ +f-. 


Theorem 2. Let [X, B, «] be a measure space, and let f(x) and g(x) be measurable 
functions. 
(a) For any real number c, 


{a | f(x)<g(x) +c}EB, {x | fx) Sg(x) +e}€ B, 
{x | f(x)=g(x) tc} EB - 


(b) (f+9)(x), max (f(x), g(x)), min (f(x), g(x), and (f+ g)(x) are measurable. 
(ce) ff, [Se ee 

(A) = a (Gr gia =04 
9 OE ee alga 10) 


then fale 7s measurable. 
g 


13.4.3. Sequences of Measurable Functions 


If [X, B, «] is a measure space, and if {/,(x)}j7_, is a sequence of measurable 
functions that converges for every x€ X, then one may ask whether /(x) =lim /;(x) 
Y eaticasd 


is also a measurable function. It is practical to modify this question. A subset 


erase 13.4, Measurable Functions LI 
eer as aE 


FE’ of X is called a set of measure 0 if H is measurable (i.e., if it belongs to 8) 
and p(h)=0. 
Convention. A property (e.g., the convergence of a sequence) is said to hold 


almost everywhere in X (abbreviated: a.e.) if this property holds for all «¢X 
except for a set of measure zero. : 


Remark 1. Since @ is also a set of measure zero, the convention “a.c.”’ docs not exclude that 
the property holds for all ~¢X. 


Examples. The statement that fim f(a) exists a.e. means that there is a set 

Ee with u(E£)=0, such that lim f(x) exists for all x€X\E. |f(xz)laoo ae. 
james 

means that there is a set HES with w(£)=0, such that |f(x)|<- for xe X\E. 


Theorem 1. Let [X, 8, u] be a measure space, and let {f;(x)}4 be a sequence of 
measurable functions. 
(a) Lf fo(x) =sup f;(7) << a.e., then 
i) cy 


a ae 


7s measurable. 
(b) If fo(x) = inf f;(7) > —- a.e., then 


_ffolz) for folx>—-—@, 
Ko {0 for fo(x)= —© 
7s measurable. —__ 
(e) If fo(x) al fi(x) and |fo(x)| <0 a.e., then 


my ifoe) for Ifu(x)| << > 
= {0 or Uoa@)lae 
is measurable. 
(d) Zf fo(x) = lim f;(7) and lfo(x)| < <> a.e., then 
j 


_ ffolz) for Ifo(x)l<e 

K-10 for ligeslam 
zs measurable. 
Remark 2. lim (upper limit) and lim (lower limit) have been explained in Subsec. 5.4.1. 
Tf lim f,(z) exists a.e., then lim =lim =lim a.e. If # is the exceptional set with j(#)=90, 

jre 

then it follows from (c) and (d) that 
is f(x) for xeX\H 


jrco 


Ma ee NeID 


is measurable. 


Definition. Let [X, B, uv] be a measure space. f(r) ts called step function vf 


z 1 for xéK; Bert 
fle) => mzle) where z={y Tn Teg, Wie’. 


Here the x; are real numbers. 


Remark 3. 7; is called the characteristic function of the set H;. [t is immediatcly observed 
that a step function is measurable. 


10 ‘Triebel, Math. Physics 


118 13. Measure Theory 13.4.4, 


Theorem 2. /f[X, 8, u] 7 a@ measure space, and if f(x) is a measurable function, 
then there exists a sequence of step functions f(x) such that 


f(x) =lim fz) for eX. 


Remark 4. The sequence 


Ie) =\ap for ap Sh@< 9p (1) 
3 for f(x)<—j 
will serve the above purpose, with /= Se heen le eagle 
Remark 5. If /(x) =0, then (1) shows that f;(x) t /(x) 1s a monotonically increasing sequence 


of step functions (Fig. 13.11). 


<----1j 


fo Fig. 13.11 


13.4.4. Convergenee Almost Everywhere, Convergence in Measure 


Definition. Let [X, 8, u] be a measure space, and let {fi(x)}F1 be a sequence of 
measurable functions. 
(a) f; Te | (convergence a.e.) means that there exists a set K€ B with u( EF) =0, such 


that f;(x)> f(x) for all «€.X\H (pointwise convergence a.e.). 

(b) {fi(x)}f4 as called an a.e. Cauchy sequence (or a.e. fundamental sequence) 
if there exists a set HEB with u(L)=0, such that, for every e=0 and all xe X\E, 
there exists a natural number my=mo(e, xr) with 

fn(x) — fin(x)| =e 
for alln=m=2 mp. 


(e) Let f(z) be a measurable function. f;- > f (u-convergence or convergence in 
measure) means that, for every e>0, : 


lim a({e | [f(x)— f(a) 223) =0 


(d) {fi(x)}721 ts called a p-Oauchy sequence (or u-fundamental sequence) if for 
every e>Oand every 6=0 there exists a natural number my = mo(e, 6) such that 
w(x | lfalor) — fila) =0}) <0 (1) 


for all L=kZ=mpo. 


Remark 1. The following generalization of (c) and (d) is of interest. If 4€%, then tera 
(u-convergence in 4) means that, for every ¢=-0, ee 


Ly ({x | e€A, Ilo) —fy{a)| Be}) =0.. 


By analogy, for -Cauchy sequences in 4 one has to replace {a | ... in (1) by {v| EA... 
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Theorem 1. Let [X, B, uw] be a measure space, and let {fi(x)}j21 be a@ sequence of 
measurable functions. 


(a) fi => f implies ear =f for every set A EB with p(A)<e. 


(b) ie a.e. Cauchy ee is also a u-Cauchy sequence in A for every ACS 
with p(A)<o. 


(Oey oe f, then there exists a sequence of natural numbers j,<jy<jx< ... such 
that fj, an cs. 

(d) Tf {f)}ji1 isa e-Cauchy sequence, then there exists a subsequence (fi, te-1 which 
isan a.e. Cauchy sequence. 


Remark 2. If eal then it follows from Remark 13.4.3/2 that f(x) is measurable. Thus 


proposition (a) of the above theorem is meaningful. 


Remark 3. The propositions (a) and (b) cannot be extended to A=X if php(X)=-. It is 
easy to find counter-examples. 

Remark +4. It is also possible to construct {fi,(2)}e=t of proposition (c) so that there exists a 
sequence E, > Ey ae > ... of sets in 8 with (HZ) =27' and 


ae fj, —f(z)|\ +0 as koa 


TEX\E 


Oi ales ae .. In other words, for every ¢+0 there exists a set E,€ 8 with u(H,) Se, such 
that thle) \ converges uniformly to f(x) on X\H,. The exceptional set in proposition (ce) 


may be chosen to be, say, qn E}. 
t=1 


Remark 5. If «(X)<-~, then one may set A= X in (a) and (b). The convergence a.e. then 
implies j.-convergence. Generally the converse is not true, but for a suitable subsequence one 
has proposition (c). 


Theorem 2. Let [X, 8, u] be a measure space, and let {f;(x)}72, be a sequence of 
measurable functions. 
(a) fj nual implies that {fi(x)}j24 1s an a.e. Cauchy sequence. 


(b) fi =f implies that {fj(x)}74 ws @ p-Cauchy sequence. 


) 
(ce) If fila) }e 2, ts an a.e. Cauchy sequence, then there is a measurable function 
- such that : mel Li then f(x) =g(a) a.e. 


) If {h(x)i1 ts @ w-Cauchy sequence, then there is a measurable function 
At e:, that . ne LG eae then f(x) = g(x) ae. 


10* 
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14. Integration Theory 


14.1. Integrable Functions, Properties of Integrals 
14.1.1. Integrable Step Functions 


Definition. Let [X, B, pu] be a measure space. 
P N 
(eA sien junciion fn) = > axe (xt), where the a; are real, Hye, U B= X, 
jal j=1 
ie) =O for k= land Ze,(x) ws the characteristic function of E; (ef. Def. 13.4.3), 
as called integrable if (Hj)<- for aj+0. One sets 
ir 


f He) Iu B aj(Hi) (with 0-0). (1) 


J= 


(b) Lf f(x) ts an tntegrable step function, and if AES, then let 
J f(x) du=f f(x) za(x) du , 
where y4 18 the characteristic function of A. 


ae 
Remark 1. It is easily shown that(1)is independent of the representation: if /(#) => Bux, (2). 
k=14 


AL 
with real f,, F,EB, U Fe=X, Fy NF, =@ for k +4, is the step funetion referred to in (a), 
then k=1 

aM 


Z oplE y= 2, Bully) . 
eel k=1 


Remark 2. If /(z) is an integrable step function, and if A¢€%, then /(x) x 4(x) is also an inte- 
grable step function. This justifies part (b) of the above definition. 


Lenmma. Let [X, %, uv] be a measure space. 
(a) Lf f(x) and g(x) are integrable step functions, and uf «and B are real numbers, 
then af(x)+Bg(x) ts also an integrable step function, and 


f (f(x) + Bglx)) du =a f fe) dp +B f g(x) Ap. 


(b) If f(x) is an integrable step function, then |f(x)| is also an inteyrable step 
function, and 


If fix) dul =f lfx)l dy. 


(ce) Lf f(x) and g(x) are integrable step functions such that f(x)Sq(x), then 
J flr) du sf g(x) dy. 
(d) Zf f(x) ts an integrable step function, and if ACB, then 


Sfx) du=f f(x) du+ f f(x) dp. 
A A\A 


Remark 8. All the above properties are easily deduced from the definition. From (c) one 
obtains 


Sg(x) due =o, 


if g(x) 20 is an integrable step function. 


if 
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14.1.2. Integrable Funetions 


Definition. Let [X, 8B, nu] be a measure space, and let f(x) be « measurable function. 
f(x) as called integrable if there is a sequence {f;(x)}7, of integrable step functions 
with wel such that for all e+0 there exists a natural number kg=ko(e) with the 
property that 

SlA(2)—frlx)| du se forall l2h=hk,. (1) 
We set 


file) dpe =lim f f(x) dy. (2) 


Remark. The limit in (2) exists, beeause 
S fala) due —f fila) dpe =| f (te(x) — f(x) Aye 
=f \fx(x) —fr(x)| duse 
for l=k2=hp(e). It is more difficult to show that this limit does not depend on the sequenee 
Ge) oa it ees is a sequence of integrable step funetions, and if the analogue to (1) 
(with g instead of f) is true, then it follows that 
Vim fgj(x) du =lim. f(z) de. 


This justifies (2). 


14.1.3. Properties of Integrable Functions 


The properties stated in Lemma 14.1.1 can be transferred to integrable 
functions. 


Theorem Let [X, ¥, u] be a measure space. 
(a) If f(x) ts integrable, then |f(x)| is also integrable, and 


If f(x) dul =f Ifa) dy . 
(b) If f(x) and g(x) are integrable, and if « and B are real numbers, then af(x)+ 
+ Bg(x) ts also integrable, and 
f (afl) +Bg(2)) du =a f fle) due +B f glx) du. 
(c) If f(x) and g(x) are integrable, and cf f(x) =g(x), then 


Jf fe) du =f g(x) de. 
In particular, f g(z) du=0 7f g(x) 20 ws integrable. 
(d) Let f(a) be integrable. Then f |f(x)| du=0 if and only if f(x) =Oa-e. 
(e) Let f(x) be integrable, and let q(x) be measurable. If f(x) =q(x) a.e., then g(x) ts 
integrable, and f f(x) du=f g(x) du. 


Remark 1. The properties (a), (b) and (e) are also valid for Riemann integrals; cf. Theorem 
9.1.3/1 and Theorem 9.1.5/1. On the other hand, (cd) and (e) are typical new porperties. 

oe - 
Remark 2. (a) implies that f*(x) =max (f(x), 0) 5 (|f(x)| + f(x)) is integrable too. 
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14.1.4. Properties of Integrals 


Definition. Let [X, 8, uw] be a measure space, and let f(x) be an integrable function. 
If EC®, and tf yu(x) ts the characteristic function of E, then one sets 


J f(x) du=f f(x) zea) de . 
Remark 1. This is the generalization of Def. 14.1.1(b). If f(z) is integrable, and if HEY, 
then f(x) yp(x) is also integrable. So the above definition is meaningful. 


Theorem. Let [X, 8, «| be a measure space. 
(a) If f(x) ts cntegrable, and if HEB, then 


Sfx) du= f f(x) dut+ f fx) dp. 
E X\E 
(b) Let f(x) be integrable, and let f(x) +0 for re X. If f f(x) du=0, then (EF) = 0. 
ss BE 
(ce) If f(x) is integrable, and if X=U EB;, with H;EB and FE, H,=@ for k+l, 
then Hoe 


ffx) dp=D ff fx) dp 
K=1 Ly, 


(absolutely convergent serves). 


Remark 2. The following proposition can be deduced from (c). If f(z) is integrable, and if 


E,cE,cEH3c..., with X= U H; and £;€%, then 
jat 


lim f f(x) du=f f(x) dy. 
joo Ey 
Remark 3. Another implication of (c) reads as follows. If f(a) is integrable and /(~) =0. then 
WE)= fade, HED, 
E 


is a measure. 


14.2. Fundamental Theorems of Integration Theory 
14.2.1. Convergence in Ly, 


Detinition. Let (|X, 3, w\ bea measure space, and let f(x) and j(2), 7=1, 2, 3, 2, 
be integrable functions. 
(a) fj i f (convergence in Ly, or convergence in the mean). means that 
1 


lim f Ifi(2) fel du=0 
(b) (flx)}f4 ts called a fundamental sequence in Ly (or Cauchy sequence in 1) 
af for all e+0 there exists a natural number my = moe) such that 
flinlz)—fnlx)l duse forall n=m=mg. 
Lemma. Let [X, 3, uw] be a measure space. 
(a) ligt implies fj Fee 


(b) Lf {fj(x)}721 vs a fundamental sequence in Ly, then {fj(x)}7_, ts also a u-funda- 
mental sequence. 


Wb? 8. 14.2. Fundamental Theorems Oe 
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Remark. If «(X) <<, then according to Subsec. 13.4.4 one has the following situation; con- 
vergence a.e. implies “-convergence, and convergence in L, likewise implies -convergence. 
‘convergence of a sequence implies the convergence a.c. of a suitable subsequence. In 


particular, it is thus possible to sclect from a sequence convergent in L, a subsequence that 
is convergent a.e. (Fig. t4.1). 


u-convergence 


convergence a.e convergence in lL, 


Fig. 14.1 


14.2.2. Lebesgue’s Bounded Convergence Theorem 
Theorem. Let [X,%, «] be a measure space, let {f;(.r)}j24 be a sequence of integrable 
functions, and let g(x) be an integrable function such that 

=a) fom 71, 2,3).. aid eX . 
If evther f; 7 POV jf Or muh then f(x) vs tntegrable, and 

fi) du=lim f f;(x) a : (1) 
Remark. Formally, one may write (1) as 

fF (lim fyler)) delim f flor) dy 

je je 


Hence Lebesgue’s theorem expresses the commutativity of limit and integration. g(a) is 
called an integrable majorant. If such an integrable majorant does not exist, the theorem is 
no longer valid, as is shown by the following example (Fig. 14.2): in Ry, let f(x) =j for 


1 
= 7 and f;(z) =0 otherwise. Then f/,(z) de=1 for j7=1, 2, ... On the other hand, /; -0 


a.e. and f0 dv=0. This shows that Lebesgue’s theorem cannot be valid in this case. 


nate 
g 2 


14.2.3. Other Properties of Integrable Functions 


There are some simple conclusions that can be drawn from Lebesgue’s theorem 
of bounded convergence as well as from previous considerations. 


Theorem 1. Let [X, B, u] be a measure space, and let f(x) =0 be integrable. If the 
functions f;(xr) are as given in (13.4.3/1), then 


f fle) du=lim f file) de. (1) 
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Remark 1. The theorem is an immediate consequence of Theorem 14.2.2, if it is taken into 
account that 0=s/,(x)=/(x) and /;—~> /. Thus the integral over a non-negative integrable 
function can be represented as the limit of integrals over step functions. This reminds us of 
the Riemann integral. The methods are, however, different. Let, for instance, X = R,, and 
let 4 be the Lebesgue measure on FR. For the construction of approximating step functions, 
one subdivides the x-axis for the Riemann integral (cf. Subsec. 3.2.1.), whereas for the 
Lebesgue integral the y-axis is subdivided (Fig. 14.3). 


Riemann integral, Lebesque integral, 
longitudinal stripes cross—stripes 


Theorem 2. Let [X, B, uw] be @ measure space. f(x) is integrable 7f and only if 
lf(x)| 2s ¢ntegrable. 

Remark 2. There is no analogue to this theorem in Riemann’s theory of integration. Com- 
pare this with Theorem 3.2.2/2(b) and with the counter-example of Subsec. 3.2.4. The 
function considered there is Lebesgue-integrable. 


Theorem 3. Let [X, B, u] be a measure space. If f(x) 1s measurable and g(x) is 
integrable, and tf \f(x)|=lg(a)| for xe X, then f(x) is also integrable. 


14.2.4. The Banach Space L,(X, 8, p) 


If [X, B, w] is a measure space, then we introduce an equivalence relation 
in the set of all measurable functions: j~g if /(x)= g(a) a.e. It is easily seen that 
~ satisfies the requirements placed on an equivalence relation: {~ f (reflexive) ; 
f~g if and only if g~f (symmetric); f~g and g~h implies that /~ h (transitive). 
So the measurable functions can be subdivided into equivalence classes: let 
[/] be the equivalence class that contains f(x) as a representative. 


Definition. L;=L,(X, B, uw) ts the set of all equivalence classes of measurable 
functions for which 


Iflllz, =f lh(x)l du<eo where h(x)€[f]. (1) 


Remark 1. It is easily seen that (1) is independent from the particular selection of the repre- 
sentatives h(a) €[/]. Hence (1) is meaningful. 


Theorem. L, 7s « Banach space with the norm |\[-]J\z, - 
Remark 2. There is no corresponding theorem for Riemann integrals. This shows a decisive 
advantage of the present, more general concept. 


Remark (convention). In our future calculations we shall use representatives /€ L, instead 
of equivalence classes [//¢€L, We also write iIf\lz, instead of iI[7lllz,. In the sense of L,, two 
functions are equal if they coincide a.e. 


Remark 4. /; a j implies that f |/;(x)| dye—f tf(w)| dye. 
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14.2.5. The Theorems of B. Levi and Fatou 

Theorem 1 (B. Lev7). Let [X, 8, uv] be a measure space, and let Ce ie ‘ (a= 


f(x) = ... be a monotone ieee de of rntegrable functions. i eee ae (Bs 
Then f(x) =lim fi(x) ts integrable, and f f(x) du = lim f fi(x) d j 


e 


Remark 1. Tn particular the theorem says that f(x) converges almost everywhere. The theo- 
rem also remains valid for 2 JS ii(a) due=, provided that f f(z) dj=- is interpreted to ex- 


press that f(x) is seeGird ble but not integrable. (Here it is also possible that f(x)=- on a 
set of positive measure.) 


Theorem 2 (fatou’s lemma). Let [X,B, wu] be a measure space. Let {f;(x)}721 be a 
sequence of integrable functions, fj(x)=0 for j=1, 2,3,... If lim f f;(xr) du<-, 
then f(x) =lim f;(x) ts ¢ntegrable, and j 

j 


ffa) duslim f f(a) 4 
j 
Remark 2. lim =lim has been defined in Subsec. 5.4.1. 


j 


14.3. Transformation Formulas 


14.3.1. Measurable Mappings and Image Measures 


We will consider a measure space [X, B, vu] and a measurable space (Y,(@) 
consisting of a basic set Y and a o-algebra © of subsets of Y (Fig. 14.4). 


Definition. A single-valued mapping T(x) of X into Y ts said to be measurable if 
T-\(M)={x | xEX, T(r)eMyEB (1) 
for every set MEG. 


aif 


Fig. 14.4 


Remark 1. Consequently, the inverse image 7’~'(./) of every set VW of © belongs to B. H , 
Y = R,, and if Cis the c-algebra of the Borel sets in /;, then the above definition agrees with 
Def. 13.4.1: thus measurable functions are special measurable mappings. 


Theorem. Let [X,%8, yu] be a measure space, let (Y,) be a measurable space, 
and let T(x) be a measurable mapping of X into ¥ in the sense of the above defini- 
tion. Let v(M)=(T-1(M)) for MEG. If there exists a sequence Fy» cP,CKR3c 


such that F;€E, v(F;)<0 and Y a0 then v is a measure. 
j=l 
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Remark 2. Since 7'(x) is measurable, »({1/) is defined for every set J7 €€. To prove the above 
theorem, one has to check the properties stated in Def. 13.2.1: The o-additivity follows from 
the setup, and g-finiteness has been required by us. It is possible to state examples which 
show that o-finiteness cannot automatically be derived from the sctup for ». 

Remark 3. One writes y=7'~‘, calling » the image measure (with respect to w and 7). We 
make the convention that the term “image measure” shall imply the existence of a se- 
quence of sets FC FyC Fc... with the properties referred to in the above theorem. 


14.3.2. A Special Transformation Formula 


Consider a measure space [X, B, uJ], a measurable space (Y,C), and a mea- 
surable mapping 7'(x) of X into Y (Fig. 14.5). According to Remark 14.3.1/3, 
let »=yT! be the associated image measure. Then [Y, &, v] is a measure space. 
Let x be the general point in X, and y, the general point in Y. Functions f on X 
are written as f(x), and functions f on Y, as f(y). Measurability, integrability 
etc. are then always to be understood as being referred to the associated measure 
space. 


(x) x 
LX bpd 
T(x) 
(y) 
Eye ar 
i! Fig. 14.5 


Theorem. (a) [f f(y) 7s measurable, then h(x)=f(T'(x)) is also measurable. 
(b) Lf f(y) ts integrable, then h(x) =}f( T(x)) ts also integrable, and 


Shy) del Y=f f(P(a)) du. 


14.3.3. Absolutely Continuous Measures, Theorem of Radon-Nikodym 


Definition. Let LX, B, uw] be a measure space. A measure »v on B ts called abso- 
lutely continuous (with respect to uw) if u(E)=0, FEB, always implies that 
ro(E)=0. 
Remark 1. If v is absolutely continuous with respect to 4. one writes »y < «x. Examples can 
easily be stated. If f(z) =0 is integrable, then 

»(E) =f he) du, EE’, (1) 


is absolutely continuous with respect to yc; cf. Remark 14.1.4/3. 

Remark 2. If /(x)=0 is measurable but not integrable, we write f f(x) du=oo. Then Eq. (1) 
is meaningful for any measurable functions /(x) =0. If v is a measure (and hence, in particu- 
lar, is o-finite), then it is absolutely continuous with respect to wu. The following theorem is 
to be understood in this sense. 


Theorem (Radon-Nikodym). Let [X, 8, u] be a measure space. If v < uw, then there 
exists a measurable function f(x)=0 such that 


wW(H)=f f(x)du forall HEB. 
E 


dike! Mh. 14.f. Product Measures, Fubini’s Theorem lee 


If g(x)2=0 ws a measurable function such that (E)=f g(x) du for all HEM, then 
f(x) =g(x) almost everywhere. i 


dy 


Remark 3. So f(x) is in effect uniquely determined, and one writes f(#) = 


i : Radon-Nikodym 
derivative. : du 


14.3.4. The General Transformation Formula 


Consider two measure spaces [X, 8, «] and [¥,@€,e] and a measurable 
mapping T(x) of X into ¥. Let w7~! be the image measure with respect to u 
and T in the sense of Remark 14.3.1/3. As regards the designation by x or y, the 
conventions made in Subsec. 14.3.2. shall apply. 


d =| 
Theorem. Let uT-!< o. If f(y) ts measurable, and cf fly) — (y) is tntegrable, 
then h(x)=f(T(x)) ts tntegrable, and e 
: d(uT-! 
fre) du= [hoy Say ae 


Remark 1. This theorem is a combination of the theorems of Subsecs. 14.3.2. and 14.3.3. 


14.4. Product Measures, Fubini’s Theorem 
14.4.1. The o-Algebra in a Product Space, Measurable Interseetions 


Let 8 be a o-algebra of subsets of the set X, and let © be a o-algebra of 
subsets of the set }. Let 


XxY={(z,y)|reX, yeV}, ExF={(x, y) lech, yer}, 
where E£ is a subset of X and F is a subset of Y (Fig. 14.6). 


Definition. (a) BX us the smallest o-algebra of subsets of the set X X Y, contarning 
all sets of the form EXF, with E€® and F c& (product o-algebra). 

(b) If GCXXY, and r€X, then Gr= ty | ye Y, (x, y)€G} ts called an wnter- 
section of G. Accordingly, GY= {x | rE X, (a, y)EG}, with ye Y, ts called an inter- 
section of G (Fig. 14.7). 

(e) If f(x, y) ts @ real-valued function on X X Y, then fcely)=f(x, y) ts called an 
intersection of f, and f¥(x)=flx, y) ts culled an intersection of f. 


Remark 1. One has G,c Y and GYCX. Further /;(y) means that, with x€X being fixed, 
f(z. y) is considered a function of y, and analogously, /¥(z) is considered a function of 2, 
with y€ Y being fixed. 


Theorem. (a) If G€B xG, then Gz€€ for all xe X, and GYE® for all ye Y. 
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(b) Lf f(x, y) ts measurable with respect to 8x, then f(y) is measurable with 
respect to for all xe X, and f"(x) is measurable with respect to B for all ye Y. 


Remark 2. Intersections of measurable sets are also measurable sets, and intersections of 
nieasurable functions are also measurable functions. 


14.4.2. The Produet Measure 


Definition. Let [X, B, u] and [Y,G, v] be two measure spaces. A measure 4 on the 
product o-algebra BX ws called product measure if EX F)=p(E)+v(F) for 
all EEX and all FCG. 


Remark 1. ;(£) and/or »(f) may be «. We recall the previous convention that 0 - 0o=0 
and @ + «=e for a>0. Further we set f (x) dj = (as we did in Remark 14.3.3/2) if f(z) =0 
is measurable but not integrable. Here it is expedient for the formulation of the following 
theorem that /(2) = is allowed on an arbitrary measurable set; cf. Remark 13.4.1/5. If the 
set with /(x)=c has a positive measure, then f f(a) dj =. 


Theorem. In the sense of the above definition, there exists exactly one product 
measure. Let 2 be such product measure, and let GES XE, then 

AG) =f 1G) du =f p(G") dv . | (1) 
Remark 2. According to Remark 1, it is possible that A(G) = 0, The expressions fornulated 
in (1) are meaningful. G.,¢€ by Theorem 14.4.1, and hence one may form »(G,), considering 
it a function of x. It has to be shown that »(@,) is measurable with respect to 8. Since 
(Gz) =0, the first integral in (1) is then meaningful in view of Remark 1. A corresponding 


statement can be made for the second integral. The product measure is also written as 
=D. 


14.4.3. Fubini’s Theorem for non-negative Funetions 


Theorem. Let [X, %, u] and [Y,6, v] be two measure spaces. Let [XX Y,B~x6G, 
UX] be the associated product measure space. If f(x, y) = 90 ts a function measurable 
with respect to 8 x, then f f(x, y) du is a -measurable function and f Hx, y) de ts 
a B-measurable function. Further, 

f flrs) Muxr)=f (fF fas y) dp) dv=f (f flr, y) dv) Ae. (1) 
Remark 1, In accordance with Remark 14.4.2/1, the integrals encountered may also be 
infinite. In any case, f f(x, y) du=f f¥(x) dy as well as JS i(x, y) dv=f f(y) dv are meaningful. 
The first assertion of the above theorem states that J /¥(x) du is measurable with respect to 
the o-algebra ©, and an analogous statement is made on Sixty) dv. As these functions are 
non-negative, all of the integrals in (1) are meaningful. (1) is meant to include the following 


assertion: if one of the three integrals is finite, then the other two integrals are finite too, 
and all three integrals are then equal. 


14.4.4. Fubini’s Theorem for Arbitrary Funetions 


Theorem. Let [X,B, uw] and [Y,G, v] be two measure spaces. Let [Xx Y, Bx, 
UX] be the associated product measure space. Let f(x, y) be uw x v-integrable. Then 
fly) ts a v-integrable function for almost all x€X, and {"(x) is a p-tntegrable 
function for almost all ye Y, 


J fuly) dv rs w-rntegrable, f f!(x) du is v-integrable, (1) 
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and 

few Auex=s(f fry) dp) dr=s (f fle, y) dr) ae. (2) 
Remark 1. Consequently /,(y) belongs to L,(¥, ©, v) for all ee X\h, with LES and «(£)=0. 
For these x-values the first integral in (1) exists. For a¢€K one may set, say, f fr(y) dv=0. 
Analogous statements can be made on fY(2). 
Remark 2. This theorem does not include Theorem 14.4.3 as a special case. Theorem 14.4.3 


has an independent meaning, because (unlike the above theorem) it is applicable to arbitrary 
non-negative measurable functions. 


Riemann’s integral is a special case of Lebesgue’s integral 
(mathematical folklore) 


11.5. | Comparison between Riemann’s and Lebesgue’s Integrals 


14.5.1. Integrable Functions 


In Def. 9.1.2(b) it had been stated when a real-valued bounded function He) 
defined on a Q-domain 2 in R, is Riemann-integrable. We shall now denote the 
corresponding Riemann integral by Rf f(x) dx. If @ is an J-domain, the Rie- 
mann integral for functions being continuous on 2 had been introduced by 
Def. 9.1.5. We shall likewise denote it by Rf f(a) dx. In accordance with Subsec. 
13.3.4., let [#,, Br, wp] be the measure space consisting of the basic set X = PL, 
the o-algebra By, of the Lebesgue-measurable sets in R,, and the Lebesgue meas- 
ure z,. We shall now denote the corresponding Lebesgue integral in the sense 
of Def. 14.1.2. by Lf f(x) dz, where we write dz instead of dy. If QEB,, we 
set f f(r) dur =Lf f(x) dx, in accordance with Def. 14.1.4. 

ia) 


Theorem. (a) Every I-domain (and hence also every Q-domain) belongs to B. 
(b) Tf f(x) is Riemann-integrable over the Q-domain Q, then f(x) is Lebesque- 
trtegrable, and 


Ri fGrde— Lf ia) dr . (1 
2 2 


(c) Tf 2 is an I-domain, and if f(x) ts continuous on Q, then f(x) is Lebesque- 
integrable, und Eq. (1) holds. 


Remark 1. Part (a) can be strengthened to read: every /-domain is a Borel set. In (b) and 
{c), one may Imagine f(x) as extended beyond Q, being zero outside of 2, so that f(a) is de- 
fined on the entire F,,. 


Remark 2. The theorem shows that the Lebesgue integral is a substantial generalization of 
the Riemann integral. In the ease of the Riemann integral we considered only bounded do- 
mains and bounded fnnetions. For the Lebesgue integral, arbitrary (Lebesgue-measurable) 
sets are admitted, and the functions may also be unbounded. Absolutely convergent tinprop- 
er Riemann integrals, as referred to in Subsee. 9.2.7., are special cases of Lebesgue inte- 
grals. In this sense, Lemma 9.2.7 makes assertions about the existence of the special Le- 
hesgue integrals Lf jx|*dzand Lf |x|* dz. Reference should also be made to Examples 2 
|z|<1 |v] >1 
and 3 in Subsee. 7.2.1. The /-function referred to in Subsec. 7.2.3. ts a Lebesgue integral. 
If improper Riemann integrals do not converge absolutely, the Lebesgue integrals in 
question need not exist. This is indicated by Example 1 of Subsec. 7.2.1. There oue has 
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uf |= 
Ry 
to introduce improper Lebesgue ee following the proven Riemannian model. 


sin x sin a 


"de does not exist either. Tt is, however, possible 


dzx=-, and hence Ss 


Remark 3. From now on we shall write Lebesgue integrals in the form 


fi(v) da instead of Lf f(x)dx or ff(x) dey. 


14.5.2. Lebesgue’s and Fubini’s Theorems 


Lebesgue’s theorem: Theorems 9.1.3/2 and 9.1.5/1 are precursors to Theorem 
14.2.2. The uniform convergence as well as the boundedness required there for 
the functions immediately ensure the existence of an integrable majorant 
function g(x) in the sense of Theorem 14.2.2. 


Fubini’s theorem: Theorem 9.1.6 now is a very modest special case of Theorem 
14.4.4. Let X=R,, let B=Bz be the o-algebra of the Lebesgue-measurable 
sets in &,, and let u=pj, be the Lebesgue measure on £,,, with analogous assump- 
Hone sconcermine 9 — he, © = 87 andeipe- 7 then Wes.) One 7) — 
=—|Kpo Ou +e. | ie the product meacure space inthe sence ol Eheorem 
14.4.4. If f(a, y), with r¢ R, and y€ R,,, is Lebesgue-integrable in A,,., then 
S fa, y)dxdy= f( f fa, y) dy) dr= f ( f f(x, y) dx) dy . 
Toe Js ie R 


If f(x), x=(a, ..-, t,), is Lebesgue-integrable in #,, then by iteration one finds 


that 
oh Fey PO ee fi cry ne eee la Jl 
a, Ry Ry Ry 

where the parentheses are usually omitted. If f(x) is defined only on a Lebesgue- 
measurable set 2 in R,, then one continues f(x) to &,\Q, setting it zero there. 
Hf the result is a Lebesgue-integrable function on R,,, then the above formula 
can be applied. In this sense one has, for example, 


b(v*) 
ffle)dx= f f fixy, x*) dxdx* 
2 2* a(x*) 


where x«* = (x9, ..., t,), the meaning of the other quantities being as indicated 
ie ie 14. 8. If the functions considered are non-negative, then Theorem 14.4.3 
including its special cases is also applicable in the sense indicated above. 


Fig. 14.8 


14.5.3. Transformation Formulas 


Let 2 be a Lebesgue-measurable set in #,,, and let f(~) be a real-valued func- 
tion defined on 2, then f(x) is continued to #,\Q by setting it zero there. We 
say that f(x) is Lebesgue-measurable or Lebesgue-integrable on Q if the func- 
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tion continued in this way is Lebesgue-measurable or Lebesgue-integrable, 
respectively, on #,. By Theorem 14.5.1, in particular 7-domains are Lebesgue- 
measurable on R,. Now Theorems 9.2.2 and 14.3.4 can be combined, where the 
symbols used have the same meaning as indicated there. 


Theorem. Let y= y(x) be a one-to-one mapping of the I-domain w in R,, onto the 
: : on Mine oiny b 
I-domain Qin R,,, with en (x) +0 for rea. If f(x) is Lebesgue-integrable 
AX I, sre) Un 
on w, and if x= x(y) ts the inverse function of y(x), then g(y)=f(x(y)) is Lebesque- 
integrable on 2, and 


{feo ae= fot) | hn (y) | dy. 


w 


Remark, In the sense of Subsec. 14.3.4., both «7 and 9 are the Lebesgue measures on R,,, and 
T(x) = y(x) (in this case T' is a one-to-one mapping) and f(x) =g(y(x)) =g(T(z)). Comparison 
of the formulas in Theorems 9.2.2 and 14.3.4 gives 


d(T) O(a; ooo 42)! 
fame | tae 


This also presents a concrete example of a Radon-Nikodym derivative. 


(y) 


(1) 


Polar co-ordinates: To illustrate (1), we consider polar co-ordinates. In R, one 
obtains 
C=7 COS, 


a(x, 
OW~<r<o, O0=Sqg<2n and eas 


Or, @) 

As to &, the polar co-ordinates r, #, p have been described in Subsec. 12.4.1. 
One obtains 
Ce) 


O(r, 3, @) 


y=" sm 9, 


=résin?. 


14.6. L, Spaces 


14.6.1. Definition 


In the measure space [X, %, J, we shall consider equivalence classes [/] of 
measurable functions as introduced in Subsec. 14.2.4. 
Definition. (a) For ls p<, L,=L,(X, B, pu) ts the system of all equivalence clas- 
ses of measurable functions such that 
1 


[LAlllz, =f Waa)? dy)? <e2 with h(x) €{f). (1) 

(b) L.=L.(X, B, w) is the system of all equivalence classes of measurable func- 
tions such that 

IfAllz..=ess sup |h(x)l<co with h(x) elf}. (2) 


Here ess sup |h(x)|=inf N, where the infimum is taken over all non-negative num- 
bers N such that w({x | h(x) >N}) =0 (Fig. 14.9). 
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Remark 1. For p=1 this agrees with Def. 14.2.4. It is easily seen that (1) and (2) are inde- 
pendent of the particular selection of the representatives h(x) €[f]. In (2) one only takes the 
“essential supremum” instead of the supremum. Otherwise (2) would not be independent 
of the selection of the representatives. The values of a given function can be altered at choice 
on a set of measure zero. 


Remark 2 (Convention). As indicated in Remark 14.2.4/3 for £1, from now on we shall use 
functions feLy instead of equivalence classes [/] €L, in our ealeulations. We shall also write 


iflz, instead of jlfllz, 


14.6.2. Hoélder’s and Minkowski’s Ine qualities 


Theorem. Let |X, B, u] be a measure space, and let L,=L,(X, B, u) be the spaces 
defined 7n Def. 14.6.1. 


(a) Lf fla)eh, and g{x)€L,, with l=<p<- and <4 = other Hay go jieis 
and the two functions satisfy Hélder’s inequality, 
Slf(z)> g@)l du SAlzllallz,, « 
(b) Lf flwjeL, and g(x)eL,, with l=ps-, then f(x)+g(x)EL,, and the two 
functions satisfy Minkowski’s inequality 
I+ gllz, =liflz, + lll, 


Remark. Corresponding inequalities for complex numbers have been stated in Theorem 6.2.1. 
The above theorem is proved by reducing it to Theorem 6.2.1. 


14.6.3. The Spaces L,(X, &, w) 


Definition. A set M ina Banach space B vs said to be dense tf for every element b€ B 
and for every e=O0 there exists an element me M such that |\b —ml| Se. 
Remark 1. Banach spaces have been considered in Chap. 6; ||-|| denotes the norm. The defi- 


nition says that every element of 4 can be approximated in the norm by elements of JI to 
any accuracy desired. 


Theorem. Let LX, B, wu] be a measure space, und let L,=L,(X, B, wu) be the spuces 
defined in Def. 14.6.1. 
(a) L, with the norm |\fl| Alle, ts @ Banuch space. Here 1S ps. 


(b) For l= p<, the integrable step functions constitute a set dense in L,. 


Remark 2. Part (a) shall include the assertion that Ifilz,, is a norm. As regards integrable 


step functions, we refer to Def. t4.1.1. To prove (b), one may use Theorem 14.2.3/1 and the 
step functions of (13.4.3/1). 
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14.6.4. The Spaces £,(R,) and £,(Q) 


In this section we shall specialize the measure space [X, %, | X= he Se 
is the o-algebra of the Lebesgue measurable sets in R,, and = 1, is the Lebesgue 
measure. 

Definition 1. Let le psoa. 
(a) Ly( Rn) =Lp( Rn, By, 1) in the sense of Def. 14.6.1. 
(b) Lf 2 as a Lebesque-measurable set in Rn, then 


Lp(Q)=tf | feL(Ra), (x)=0 for x€ Rn\Q}. (1) 
Remark 1. We recall the previous convention that we write /¢L, instead of (more precisely) 
[fJ¢L,: (1) has to be understood in this sense. Thus, like L(y), Lp(Q) consists of equiva- 
lence classes of functions that are equal almost everywhere (in the Lebesgue sense). Like- 
wise following previous conventions, we write 

ae 
Ifllz,cay=( S\f(x)|P dx)? for Lap<e 
2 

and 


IIfilz_(2) =e88 sup jf(z)| for poo. 
Teo 


Definition 2. Let Q be adomain in Ry (v.e., an open set in R,). A Lebesqgue-measur- 
able function f(x) ts called a function with bounded support (or a finitely nonzero 


function) (with respect to 2) if supp f={x | f(x)+0} ts bounded and les in Q 
(Fig. 14.10). 


oe Fig. 14.10 


Remark 2. supp / means the support of f. Since 2 is open and supp f is closed and bounded, 
supp / has a positive distance from the boundary 02 of 2. Here the boundary is defined as 
J2- OVO. 


Theorem 1. Lef 2 be a domain in Rp. 

(a) For lspso, L,(2) with the norm IW/llz,,«2) us @ Banach space. 

(b) If l= p<-, then the functions which are continuous and have bounded sup- 
port on 2 constitute a dense set in Lp(S2). 


Remark 3. Part (a) is also true if 2 isanarbitrary Lebesgue-measurable sect. Part (b) is prov- 

ed using Theorem 14.6.3(b). An arbitrary integrable step function is approximated by fune- 
nN: 

tions of the forma Zi %jLQ,(2), where £Q,(«) is the characteristic function of an (n-dimension- 

al) rectangular parallelepiped Q;, with Q;<Q. Subsequently the characteristic function of 

an n-dimensional rectangular parallelepiped Q, with Q CQ, is approximated in an elemen- 

tary way by continuous functions with bounded support. 


Theorem 2. [f 2 is a domain in Ry, and if Lap<o, then the functions with bound- 
ed support that are differentiable on 2 up to arbitrary order constitute a dense set in 
Le) 
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Remark 4. This important theorem is proved using Sobolev’s mollification method. Let 
as, 
cel—i?P for |zj<1, 
oa) = 


0 for \2l=1, ff oe) de=il . 


Ry 
Using Theorem 5.4.5/3 it can easily be shown that @(x) has derivatives up to arbitrary order 
in R,. Further, supp o={y | |y|=1}. For 4>0 one sets w,(x) =h-"w =| (Fig. 14.11). One 
has supp @, ={y | |y| sh}. By Theorem 1(b) it will suffice to approximate continuous func- 
tions with bounded support. If f(z) is such a function, then one scts 
file)= fle) wm, (w@—2) dz, A=O. 
112 


It can now be shown that f,() has derivatives up to arbitrary order on R&,. Further, 
supp /, <2 for small positive h, and f,—>/ in L,(@) (Fig. 14.12). For details, ef. [66], 
pp. 37-40. 


w, (x) 
w(x) 
=i ~h h 7 Fig. 14.11 Fig. 14.12 
15. Complex Function Theory 


15.1. Holomorphic Functions 


15.1.1. The Complex Number Plane C 


A complex number z€C, can be written as z=x+1y (normal representation 
introduced in Subsec. 1.2.4.) or as z=re'” (representation in polar co-ordinates 
as introduced in Subsec. 5.2.4.) (Fig. 15.1). For the following consideratious it will 
be convenient to close C’, by a point at infinity. To thisend we imagine asphere 
placed on the complex number plane C,, touching C, at the point 0 CE Taal 2) 


north pole 
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If this point of contact is the south pole of the sphere, then we draw from its 
north pole a ray to the point z€C,, which pierces the sphere at the point ¢. 
The correspondence is one-to-one: every ¢ on the sphere, except for the north 
pole, is assigned exactly one z€C,. The method is called stereographic projection, 
and the sphere is called Riemann’s sphere. Now this sphere is closed by ineluding 
the north pole. In the z-plane, the north pole is assigned the point oo by the 
stereographic projection, and one writes C=C,U {<} (closure of the complex 
number plane C, by , the point at infinity). Any concepts related to o can then 
be nade transparent by reference to Riemann’s sphere, e.g., the phrase that 
{z| 2€C,, lel>N}U {~} are neighbourhoods of <>. Neighbourhoods of z¢€C, have 


: : ; . ee 
the usual meaning: e.g., circles with the centre z. If z=re'?, then w=—= ee) il 


; 1 z T 
particular then {z| |z|>N}U {-} becomes {w | |ew| < wt . | Here one has he ‘ 


oO 


Thus neighbourhoods of « become neighbourhoods of 0. (Following the con- 
vention made in Subsec. 6.2.2., we set |z|=Vr2+y2, with a= Rez and y=I1m 2). 
This suggests the following method: (i) Concepts of the complex function theory 
are defined initially for points z with |z| <~. (ii) For z= one defines the concept 
in question by starting from the already defined concept for w=0 and substi- 


Pett 
tuting — for w. 
Zz 


15.1.2. Holomorphic Functions 


We shall consider functions f(z) defined on the plane C=C,U {<>}, whose val- 
ues are elements of C, i.e. f(z)€C. If C is imagined as being mapped onto the 
Riemann sphere in a one-to-one way, then this means that we consider functions 
defined on the Riemann sphere, with their values lymg on the Riemann sphere 
too. It is, however, more convenient to refer to the plane C, even though in this 
case the point e must sometimes be given a preferential treatment. Let @ be a 
domain (i.e. an open set) in C (Fig. 15.3). If o€@, then {z| |z|>N}cG@ must be 
true for sufficiently Jarge values of NV. 


Siriaas 


Definition 1. Led f(z) be defined on the domain G in C, v.e., Dif) =G, and let f(z)EC, 
for z€G (v.e., f(z) #0 for 2€G). Then f(z) ts sard to be continuous at the point 2o€G, 
Zoo, af for every e=O there exists a sufficiently small positive number 6=6(e) 
such that |f(z)—f(z)| =e for |jz—z| <6. f(z) as said to be continuous at the point < 
(provided that ~€G) if for every e=+O0 there exists a sufficiently large number 
N=N(e) such that |f(z) —f()| Se for lzel>N. 


Remark 1. Here 6 >0 must be chosen so small that {z | |z—z)| <6} CG. If @€G, then N must 
be chosen so large that {z| |z|>.V}CG. One observes that the continuity at the point is 
shown following the principle set out at the end of Subsec. 15.1.1. 


ial 
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Definition 2. Let f(z) be defined on a domain G in C, D(f)=G; and let f(z)€C, for 
2eG. 
(a) f(z) ts suid to be differentiable at the point 2€G, with 2+, uf the Cy-limit, 
: 1 : ae : , 
om ok (f (29 +h) —flzo)), extsts, which is then denoted by f(z). f(z) ts said to be 
He AU) hy 
differentiable at the point < (provided that -€G@) tf w(z)=f S as differentiable at 
the point z,=0. In this case one sets f’(o) =0. ig 
(b) f(z) ts called holomorphic on G if f(z) vs differentiable at every point z€G. 
Remark 2. Conscquently, f(z) is differentiable at the point 29+, the value of its derivative 
being /’(2), if for every «=>0 there exists a 6=0(e) such that 


Zo th) —flz 
I (%o : {20) f(z) {se for heC, and |hiso. 


Here /’(zo) €C, (Le. that /’(z9) =< is not admitted). This is the complex analogue to the real 
differentiability considered in Subsec. 3.1.1. The rules stated in Subsec. 3.1.2. for real dif- 
ferentiation are also applicable to complex differentiation. In particular, the chain rule of 
Theorem 3.1.2/2 also applies to complex differentiation. 


Z 
Remark 3. The calculating rules for ~ are as follows: z+e=zg—w=%+: o>, and —=U. 
Z 
= for ¢+0 and z+. (Other than in the case of &,. no distinction is made between 
sx and —-«.) Setting f’(-) =0 in part (b) of the definition is natural because, if e(z) =/ (=) 
is differentiable on a neighbourhood of 0, then applying the chain rule one obtains 


\ i 1 
[(2=-w’ (=) go = (0) 0 for ge. 


Lemma. /f f(z) is differentiable at the point z,€G, then f(z) 1s continuous at the 
pont Zo. ; 


Remark 4. The lemma holds especially for =z) €G@. 


15.1.3. Examples of Holomorphie Funetions 


Polynomials: Let ao, ...,a@y be any complex numbers, then the polynomial 
ay HY 


ie) P(2)— > aie" is holomorphic on G—C,—C (22), Onethasy) (@)— > tae = 
k=0 


itive special case f{(z)=dq, f(z) is holomorphic on C, and f’(z)=0. 


1 1 
The funetion —: The function f(z) =— is holomorphic on G=C\{0}, and one has 
2 2 


fa=— a (including z=), 
ee 


Power series: Complex power series have been treated in Subsec. 5.4.1.; the ra- 
dius of convergence has the meaning given in (5.4.1/2). 


oo 


Theorem. Jf the power series P(z)= >) az (z—2 9)", with 29€Cy, has the radius of 
K=0 : 
convergence R, then P(z) is holomorphic on {z|lz—zgl<R}, and P’(z)= 


= RG (g—2,)°—*, 


sl 
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Remark. This is the complex analogue to Theorem 5.4.3/3. The polynomials are a special 
case where the radius of convergence is R=. : 


The function e*: The function e” has been treated in Subsec. 5.4.6. According to 
Theorem 5.4.6 the above theorem is applicable, and it follows that e? is holomor- 


oc 


: 4 a : 
phieton G=€ 7) One has (e7)"— > fa e*. Consequently, the functions a? = e?!™, 
kaQ & 


with @>0, are also holomorphic on O,, and application of the chain rule gives 
(a) —Ina az 


15.1.4. The Cauehy-Riemann Differential Equations, Harmonie Functions 


If f(z) = f(x, y) is defined on a domain G in Cy, and if f(z)€ Cy, then we set f(z) = 
= Re f(z) +i Im f(z) =u(zx, y)+i v(x, y), where u(x, y)=Re f(z) and v(x, y)= 
= Im /(z) are regarded as real-valued functions on Ry (Fig. 15.4). If f(z) is holo- 
morphic on G, then it is easily verified that u(x, y) and v(x, y) are continuous on 
G and have partial derivatives of first order on G with respect to x and y. From 


f(xp thy, Yo + he) — (xo, Yo) 
hyt+ithy 


I’ (4p) =lim » h=hy+ihe, 


One observes that 
of ou . ov 


Of meci | oe, se 
erat! (20), By by nee =if’(2q) . 

By comparison of the real and imaginary parts of the last two equations one 
obtains the Cauchy-Riemann differential equations 


ou 2 Ov Ou Ov 


—— = Sas —_ = = 5 Xy Cr 1 
Oe Gy Gh he (m, y)é (1) 
yi 

sls Fig. 15.4 


Theorem 1. Let G be a domain in C, (=Ry). Further let u(x, y) and v(x, y) be 
continuous and continuously differentiable on G (this means that u(x, y) and v(x, y) 
are continuous on G and that the partial derivatives of u(x, y) and v(x, y) with 
respect tox and y exist and are also continuous on G). Then f(z) =u(x, y) +i v(x, y), 
utth z=x+1y, 1s holomorphic on G tf and only tf tt satisfies the Cauchy- Riemann 
differential equations (1). 


Remark 1. If f(z) is holomorphic on G, then the above preparatory considerations show that 
du du dv ov 
Ox’ ay” dx ay 
exist on G and that the Eqs. (1) are satisfied. Consequently, except for the continuity of 
these partial derivatives, the conditions of the theorem are necessary for f(z) to be 
holomorphic on G. 


u(x, y) and o(z, y) are continuous on @, that the partial derivatives 
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Remark 2 (The functions In z and 2”). Let G be a connected domain in Cy, and let 0¢G (thus 
any two points 2; and 2, can be connected by a smooth curve that lies entirely in @) (Fig. 15.5). 
Further we require that there be no closed path within G on which it is possible to move 
round the point 0. Then. by Def. 5.3.1, 


{(2)=In z=In r +ip (2) 
= 4 
is continuous on G, One has u(x, y)=In r=In Va? + y? and o(x, ¥) =aretan 4 (Fig. 15.6). It 


is now easily verified that the conditions of the theorem are met by u(a, y) and v(x, ¥). 


Hence In zis holomorphic on G, and (In 2)’ =— . The concept of holomorphy can be extended 


os 
ae 


: Ae 

0 Fig. 15.5 0 x Fig. 15.6 
immediately to Riemann surfaces. Then Inz is holomorphic on the Riemann surface de- 
seribed in Subsec. 5.3.1. If w+0 is a complex number, then, using the chain rule, it can be 
shown that 20 — gin j is holomorphic on G. According to Subsec. 5.3.2. the holomorphy can 
again be extended to Riemann surfaces. 


Definition. Let G be a domain in C,. The real-valued function u(x, y) ts called har- 
monic.on G if u(x, y) ts continuous on G, if it has first- and second-order partial 
derivatives on G with respect to x and y, and satisfies on G the differential equa- 
ton 
O-u  7u 
Au=—+=5 
Ox? Oy? 


Theorem 2. Let G be a domain in Cy, and let f(z) be holomorphic on G. If u(x, y)= 
= Re f(z) and v(x, y) = Im f(z) have continuous partial derivatives of second order on : 
G with respect to x and y, then u(x, y) and v(x, y) are harmonic on G. 


=e 


Remark 3. The theorem is a simple implication of (1): 
au Ou ae Pe 
ox? dy? dxdy dydx 
(cf. Theorem 8.1.2). As we shall see later on, u(x, ¥) = Re f(z) and v(2, y) =n f(z) have par- 


tial derivatives of arbitrary order if the function /(2) is ne So the additional as- 
sumption on the second-order derivatives is not oe ‘in the above theorem. 


15.2. Integral Theorems 


15.2.1. Complex Curvilinear Integrals 


Following the lines given in Subsec. 8.1.5., we consider curves€ in the complex 
plane C, which are given by 2(f)=.r(é)+iy(é), where a=f=f. Further, x(¢) and 
y(¢) are assumed to be functions continuously differentiable on [«, 6]. We write 
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2 (t)=.2'(t) +iy'(2). The orientation of the curve is given by the direction of in- 
creasing ¢-values, i.e. A =2(2) and B=2(f) (Fig. 15.7). The curve may be closed, 
1.€. 2(z) =2(8), but shall not have any other double points, i.c., 2(¢,) + 2(ty) for 
%St\<i<f and a<t;<t,=f. 


Definition. 7f f(z) vs a continuous complex-valued function on &, then 


[fe a= ft (¢)) 2°(t) dé 


= Re [/(2() 2] dé +i fim [*(2(¢)) 2’()] de. 


Remark 1. Complex curvilinear integrals f f(z) dz are thus reduced to real integrals over 
c 

intervals [x, 8]. Consequently the properties of these real integrals are transferred to the 

complex ones. e.g., 


4 (Aifi(=) + Aofa(z)) de =A, fi) ) dz +hy fie ) dz. 


where A; and 4, are complex numbers and /,(z) and f,(z) are complex-valued functions con- 
tinuous on ©. 


Remark 2. gi ) dz is independent of the choice of the parameter ¢. If t=1(t) is continuously 


differentiable on [x. f], if r’(t) +0 for t€[«. Bl. and if y=t(x) and 6=7(f) (Fig. 15.8), then 
B 6 


: : dw 
{He Ge id= f toon) FE ar with w(r) =2(t(z)) . 


a pe 


Remark 3. It is also possible to determine f f(z) dz directly asa limit of Riemann sums, by 
€ 


analogy to Def. 3.2.1. To this end one chooses any points 2, €€, with 2, =A and zy =B (Fig. 
15.9). Then 
V-! 


ff(2) dz=lim 2 fle) (2e41—-Ze) > 


where the limit is understood in the sense of increasing fineness of subdivision (in a way anal- 
ogous to that of the Riemann integral). 


Z Vig. 15.9 


Properties of curvilinear integrals. 1. (Fig. 15.10) If € is a smooth curve, and if 


¢ =€,UG,, then 
J fz) dz=_f fz) dz+ f fe) dz 
6 G, 6, 


If ©’ coincides with the curve ©, but has the reverse orientation, then 


J fz) dz= — f f(z) de 
G’ € 


We point out that © denotes the curve including its orientation. 
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2. If #,(z)=f(z) (uniform convergence of 7;(z) to f(z) on @), then 
Jf fz) dz— ff) dz. 
Ly ics 
This can, for instance, be concluded from Theorem 3.2.3. 
Nee 
A A 
yy B 
Yu at 
A Fig. 15.10 A Fig. 15.11 
3. If € is a smooth closed curve (Fig. 15.11), then f f(z) dz does not depend on 
& 
whether the integral fis taken from A to A or from A’ to A’. 
ity 


4. If C=C ,UC,UC3U ... is a piecewise smooth curve with a finite number of 
smooth curve segments ©, ©, ©; etc. (Fig. 15.12), then we set 
f fle) dz= ffle)de+ ffe)de+ ff(e)dz+... 
& Gy ey G3 
The properties stated above remain valid for curves of this type. 


5. Let f(z) be a function holomorphic on a domain G in C,. If © is a smooth 
curve in @ (Fig. 15.13), then 


ee) dz=/(B)—/(4). 


Hence in this case the integral is independent of the path of integration. 


4; Fig. 15.12 Fig. 15.13 


6. Under the conditions of the above definition, one has 
B 
| f f(z) dz] = max f(z) f lel de. 
¢ zee a 


B Bae ener ee meee 
According to Subsec. 9.2.3., f lz’(¢)| dt= f Va’2() +y’2(t) dt is the length of the 
curve ©. G g 


15.2.2. Cauchy’s Integral Theorem 


A domain G in C, is called simply connected if every closed curve © in G can 
continuously be contracted within G into a point (Fig. 15.14). The curve ©’ in 
Fig. 15.15 cannot continuously be contracted within G to a point: hence in this 
case G is not simply connected. 


Theorem (Cauchy’s integral theorem). Let G be a bounded, simply connected do- 
main in O,. Further let ©, and ©, be two piecewise smooth curves in G, extending 
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Fig. 15.14 
from A to B (Fig. 15.16). If f(z) 7s holomorphic on G, then 
J He) dz= f f(z) dz. 
OF G2 


Hise loais5 


Remark 1. Mathematics is art. Surely this is an exaggerated and somewhat biased state- 
inent. But it emphasizes an essential criterion of mathematics, that is the aesthetics of its 
theorems. There are not few mathematicians who regard Cauchy’s integral theorem as the 
most beautiful theorein of mathematics. 


Remark 2. The theorem can be reformulated as follows. Let G again be a bounded simply 
connected domain in C,, and let © be a piecewise smooth, closed curve in G. If f(z) is holo- 
morphie on G, then f f(z) dz=0. 
& 
Remark 3. The requirement that G be simply connected is essential for the theorem. If 
1 1 
G= \z|><\|2|<2;, then@ is not simply connected (Fig. 15.17). f(z) oa is holomorphic on 


G. If ©={e¥, 0=t<27} is the unit circle, then one obtains 


Dine 
1 poe : 
free f eMie' Cli Was. 
& 0 


Hence in this case Cauchy’s integral theorem does not apply. 


Fig. 15.16 Fig. 15.17 


Remark 4 (Fresnel’s integrals and Cornu’s spiral). The theorem can also be used effectively 
in the calculation of real integrals. In geometrical optics, Fresnel’s integrals 
t t 
2(t) =2(t) +iy(t)= f cos v2 dv+i f sin v2 dv 
0 0 
play an important role (Fig. 15.18). What is of interest is their behaviour as f >. One ob- 
tains the picture of Cornu’s spiral (Fig. 15.19). For the calculation of the limits, 
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Cauchy’s integral theorem gives the starting formula 
el?’ dz=0. (1) 
€, UG, UG; ; 
Here ©, and G; are straight line segments, while ©, is a eircular are (Fig. 15.20). One has 
t 


felde= f eidx=a(t) (2) 
G, 0 

ant t 
et d= (e-Ve 4dr. (3) 
G3 0 


hr 


<0 Fig. 15.19 x Fig. 15.20 


.7 t 
a Se P é : - 5 ane 
On ©, we set 2(r)=r-e 4. Then 22=ir?, and we obtain (3). Letting t+ one gets f e~7dr— 


0 


co 


= - 
> | e'dr= 5 (ef. Subsec. 9.2.6.). Tt can also be shown that lim [ eiaz=0. As t+ 

0 a ac, 
in (1), this gives 


15.2.3. Cauchy’s Integral Formula 


Let G be a connected, bounded domain in ©;, its boundary consisting of a 
finite number of components. We choose smooth, closed oriented curves So, ©, 
..., ©, which enclose the respective boundary components (see Fig. 15.21). We re- 
call that, according to Subsec. 15.1.2., curves must not have any double points. 
Now let @p be the outer curve. We connect the curves €,, ..., ©, by smooth curve 
segments with Cy (Fig. 15.22), so that altogether we obtain a piecewise smooth, 
oriented curve @ consisting of Cp, ©,, ..., €, and the connecting segments, which 
are passed through in both directions (by way of exception we admit double 
points here, because the connecting segments have to be counted doubly). If f(z) 
is holomorphic on G, then, by Theorem 15.2.2, f f(z) dz=0. Here the integrals 

€ 
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over the connecting segments cancel each other, and one obtains 
n 


fi” dz Pe il hel d2e— 0. (1) 


This is a generalization of Theorem 15.2.2 for multiply connected domains. 
Theorem (Cauchy's integral formula). Let G be a bounded, simply connected domain 


m C1 and let © be a smooth curve in G (Fig. 15.23). [f f(z) ts holomorphic on G, and 
af z ws a point lying inside the curve ©, then 


Ie ile 
fea | eae. a 
aT J Z2—% 
6G 
Tee, 113).283 
Remark 1. - +, is holomorphic on G\{z}. If A, is a circle of radius ¢ around 2, then (t) can 
Sede 
be applied to — - . giving 
> fl2) > fl) 
—— > Se aed 
| 22 a | Z— 2 ge ey ORE: 
x é€ 


where Remark 15.2.2/3 has been used. In other words. the theorem is obtained from (1) by 
passing to the limit ej 0. 


Remark 2. Formula (2) means that f(z) can be calculated if the values of f(z) are known on a 
curve that encloses 2. This is a very remarkable situation, to which there is no analogue in 
the case of real functions. It also shows that holomorphy is something quite extraordinary. 


15.3. Properties of Holomorphic Functions 
15.3.1. Differentiability and Derivative Formulas 


Theorem. If f(z) 7s holomorphic on a domain G in Cy, then f(z) has derivatives 
(with respect to the complex variable z) up to arbitrary order on G, and the n-th 
derivative f(z) is also holomorphic on G. Let (S be a closed, piecewise smooth curve 
in G, and let the interior of © solely contain points of G (Fig. 15.24). If z 1s a point 
inside of ©, then 


ip =O, Ul, By coe (1) 
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Remark 1. For a holomorphic function f(z), the derivative f(z) has been defined in Def. 
15.1.2/2. If f/(z) is also holomorphic, then one can form f@)(z) = (f/(z))’, ete. f(z) is the n-th 
iteration. Since the complex differentiation with respect to z=x-+iy includes real differen- 
tiation with respect to « and y as special cases, the above theorem implies that /(z)= 
= f(x +iy) =g(x, y) has derivatives up to arbitrary order with respect to x and y. If f(z) is 
once differentiable with respect to the complex variable (and hence holomorphic), then, 
by the above theorem, f(z) has derivatives up to arbitrary order with respect to the 
complex variable. Also for this property there is no analogue in real analysis. 


Remark 2. Formula (1) is derived formally from (15.2.3/2) by differentiation under the sign 
of integration. For n=0 one has f(z) =f(z), and (1) coincides with (15.2.3/2). 


15.3.2. Taylor Series 


Theorem. /} f(z) 7s holomorphic on a domain G in C,, then f(z) can be expanded in an 
absolutely and uniformly convergent power series in the neighbourhood of an arbi- 
trary potnt 29€@: 
2 ; ie) 
= (z—Z_)* d 1 
Iz) 2 axlz—zo)* with ap= ~ Ite C—z)Frt c. (1) 


Here® ts a piecewise smooth curve in G that ee zg, and the interior of © contains 
points of G alone (Fig. 15.25). 


Fig. 15.25 


Remark 1. In Theorem 15.1.3 we had seen that complex power series with a positive radius 
of convergence, #, are holomorphic functions. The above theorem states (at least locally) the 
converse of this assertion: if one chooses any point z)€G, then f(z) can be represented as a 
power series with a positive radius of convergence R in a neighbourhood of zp. R has the 
meaning of (5.4.1/2). Thus holomorphic functions can also be defined as those functions 
which can be expanded in a complex power series with a positive radius of convergence at 
every point of their domain of definition. This is Weierstrass’ approach, whereas the pre- 
vious variant is referred to as Riemann’s approach. 


Remark 2. A point of interest is the magnitude of R, the radius of convergence of the series 
(1). It turns out that (1) conver ges on every disk around zg that lies entirely in G. Thus the 
radius of convergence F of (1) is at least equal to the radius of the largest disk around zp 
that lies in G (Fig. 15.26). 


Remark 8. From (1) and (15.3.1/1) it follows that 


f(z)= SEges (2—=2 9)" - (2) 


k=0 
This is the complex analogue to (5.4.4/1). 


Remark 4. As an example, let us expand In z at the point 1 (Fig. 15.27). According to Re- 
mark 2 and Remark 15.1.4/2, the power series for In z at the point z2g=1 converges on a disk 


1 
of radius 1 around the point 1. From (2), n1=0, (In z)’=— and the expressions for the 
z 
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higher derivatives one obtains 


ket 
kn = a= “ 


(2-1). 


Fig. 15.26 SU” Fig. 15.27 
15.3.3. The Identity Theorem 


Theorem. Let G be a connected domain in Cy, and let {zp }¥_1 be a sequence of points of 

G, with 2+2% for k+l, which has at least one accumulation point in G (Fig. 15.28). 

4 f(z) and He ) are any two functions holomorphic on G, with f(zn)= glee) for 
= 13, ..., hen (z)=9@) ott. 


Fig. 15.28 


Remark. Thus a holomorphic function f(z) on G is uniquely determined by the values /(z;). 


The behaviour in the small (e.g.,in the neighbourhood of the accumulation point of {z,};_1) 
already implies the global behaviour. 


15.3.4. The Maximum Principle 
Theorem. Let f(z) be holomorphic on a connected domain G in Cy. Let Gy be a sub- 


domain of G, and let z)€Gp (Fig. 15.29). If Iflz o)| = max If(z)l, then f(z)=ec ts 
constant on G. 


Fig. 15.29 


Remark, Hence the constants are the only holomorphic functions f(z) on G for which |f(z)| 
may have a local maximum on G. A corresponding assertion is also true for local minima of 


if{z)| (where, of course, |f(z)| =0 must be excluded). 


15.3.5. Liouville’s Theorem 


Theorem. A holomorphic function that is bounded on the complex plane as constant. 


Remark. The theorem is a consequence of (15.3.2/1). Tf one sets 2=0 and ©={¢ | |¢/ = Rh} 
there, one obtains 


cel Wel ae -? gaeheaece and kK=1,2,... 
Hence f(z)=ao 
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15.3.6. The Fundamental Theorem of Algebra 


We prove the assertion made in Remark 5.3.3/4: the polynomial P(z) = >' azz* 
0) 
with the complex coefficients ag, ..., an, dy +0, n=1, has at least one complex 


zero. If we suppose that P(z) has no zero, then is holomorphic on C,. It is 


il 
P(z) 
is also bounded. Then, by Theorem 15.3.5, we find that 


1 
easily seen that | —— 
‘ie P(z) 
Bley i.e. that P(z) is constant, which contradicts the assumption. So 
& 


simple is the matter if adequate mathematical means are at hand. 


15.4. Theory of Singularities 


15.4.1. Laurent’s Series 


Theorem. Jf f(z) is holomorphic on the circular ring {z| R,=<|z—zo|< Ro}, with 
zy€C, and 0O= Ry< Ry< oo (Fig. 15.30), then 


fZ)= > ax (z—29)* with pens [See (1) 
ts 


== 271 = 2) 


Here€ = {w| |wl= R}, Ry=< R= Ry, with the orientation indicated in the figure. For 
a suffrcvently small e>0, the series (1) converges absolutely and uniformly on the 
etrcular ring {z| R,+e<lz—z| = Ry—«}. 


Rp 


x . 
Ry Fig. 15.30 
Kemark 1. The representation (1) is called a Laurent series. One has 


=f) 
Hz)= Dd ap (2 -29)* a 2, Gy, 2 —Z)* 


ee —————— 

regular part main part 
The regular part converges for |z —zp| < fy, whereas the main part converges for |z — 2o| > Ry. 
Remark 2. The following assertion can be derived from the identity theorem 15.3.3: if f(z), 


as given in the above theorem, can be represented as f(z)= 2 6, (z—z)* on the circular 


ring {z | &4<|z—2)|<R}, then 6, =a, for k=0, +1, +2, ... Hence the coefficients a, in (1) 
are unique. In particular this is then also true for power series. 


15.4.2. Singularities 
Definition. Let f(z) be holomorphic on the punctured disk {z | 0<|z—zy| <e}, zy Cj. 
wee > dy (@—z,)" be the corresponding Laurent series. 


eo 


(A) zy ws called a removable singularity if a, =0 for k= —1, —2, —3 


goes 
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(b) zy vs called a pole of order m if Am +0und ap=0 fork=—m—1, —m—2, 
where Gee Wao, 3, ... 

(€) 29 ts called an essential s(ngularity tf 2, 1s neither a removable singularity nor a 
pole. 


wong 


Remark 1. In the case of a removable singularity one simply sects f(zj)=d 9. Then f(z) is 
holomorphic on the disk {z | |z—zp|<e}. The singularity has disappeared, and f(z) = 


= a; (2—<2»)* isa normal expansion in a power serics on {z | |z¢—z|<e}. In the case of a 


k=0 
pole at z one has 


a_m a_em+) a_y 
{2) =— + —_— +... +——-+ Ag+, (2—2) +... 
(2—=29)"  (@—2)""! 4h) 


Tf 2) is an essential singularity. then the main part (cf. Remark 15.4.1/1) contains infinitely 
many terms with non-vanishing coefficients a,. 


Theorem. Let f(z) be a function holomorphic on the punctured disk {z | O<|z— elem 
ZO € C;. 

(a) 29 ts a removable singularity if and only 2f |f(z)| ts bounded in a neighbourhood 
of Zo: 

(b) 29 28 @ pole if and only tf f(z) © as z> Zp. 

(e) (Lheorem of Casorati-Weterstrass). zy is an essential singularity if and only 

af for every e>O0, every d=>0, and every weC, there exists a z€C, such that O< 
lz—zgl<e and |f(z)—wl <6. 
Remark 2. In the older literature, the Casorati- Weierstrass theorem is also referred to as the 
theorem with the three “arbitraries’’: let w€C, be an arbitrary point, then in an arbitrary 
neighbourhood of zp there exist points z whose images f(z) lie arbitrarily close to the point w. 
The values of the function f(z) are thus terribly jumbled up in the neighbourhood of 2p. If, 
on the other hand, 2 is a pole, then it is natural to set f(z) =<. Then zp is a harmless singu- 
larity that can be included in the domain of definition of f. 


Remark 3. So far we assumed that z)€Cy. If z=, then, following the lines of Subsec. 
1 

1S tall, We Seb Ge) =i (=) , considering g(z) in a neighbourhood of 0. If f(z) is holomorphic 
z 


for 2| >.V, then g(z) is holomorphic for 0 <!z! <x" Then 


PS 


and hence 


eel 


regular part main part 


iis a polewt 2, — has the form wie 2”--0a_,,.42-! +... 


15.4.3. Systematic Complex Function Theory, Rational Functions 


If G is a domain in C (that is the complex plane including -), then one may ask 
for the set of all functions f(z) that are holomorphic on G. Or one may ask for 
functions f(z) that have nothing but poles as singularities on G (according to 
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Remark 15.4.2/2, one sets f(zo) = ce at a pole z,). Investigations of this kind are 
the subject of systematic complex function theory. In some simple cases it is 
possible to give final answers. 


Theorem. (a) The constants are the only holomorphic functions on C. 
(b) The rational functionsare the only functions that have only poles as singular- 
ities on C. 


Remark 1. Naturally, the functions admitted to competition in (b) are only those which are 
either holomorphic in the neighbourhood of every point 29€C or have a pole in the sense of 
Remark 15.4.2/1 there. If z9 =~, then see Def. 15.1.2/2 and Remark 15.4.2/3 for comparison. 


Part (a) is in effect a consequence of Theorem 15.3.5. The fact that the rational functions 
n m =I 
R(z) =( pa “2') ( 3, bel) , with complex coefficients a;, 6), and with b,, +0 and n=O, 1, 
k=0 i=0 
2,... and m=0, 1, 2,..., have only poles as singularities on C follows from the partial frac- 
tion decomposition in Theorem 7.1.2, which is also applicable to 2€C, instead of xe Ry. 


Remark 2. Functions f(z) which are holomorphic on C,;=C\{-} are called entire functions. 
Functions /(z) which have only poles as singularities on C, (and are holomorphic in the 
neighbourhood of all other points of C;) are called meromorphic functions. 


15.5. Theory of Residues 


15.5.1. Cauchy’s Residue Theorem 


Theorem. Let G be a simply connected domain in Cy, and let © be a smooth closed 
curve in G (Fig. 15.31). If 24, ..., 2y are points lying rnside of ©, and tf f(z) ts holo- 
morphic on G\{z,}U {2}U...U {ey}, then let 


f= 3 pea ay, Falck). 
as 


5 Se 


Fig. 15.31 
be the local expansions in Laurent series. Then 
ee (3 Ss 
oat {fe dz= > ¢,_1 (residue theorem) . (1) 
“atl l=1 ‘ 
C 
Remark 1. c;,_; is called residue at the point 2). 
: : Choi rons F 
Remark 2. If z, is a pole of order 1, i.e., f(z) = 2 +¢,y +... then it is possible to calcu- 
B= By ; 


late c,_; from ¢;,_,= lim (z—z) f(z). This is the same method as in the partial fraction de- 
ez] 


composition used in Subsec. 7.1.2. 


Remark 3. As in many integral theorems of complex function theory, it can be used in a 
tricky way for the calculation of real integrals (see for instance Remark 15.2.2/4). Let us 


oo i 


: ; ; sin 2 sin x : : 
outline here how the improper Riemann integral i dly=z I da mentioned in 


— oo 


be, 
Gr 
Qr 
ig 
= 
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1 
Subsee. 7.2.1. can be calculated. Sis (e" er") is holomorphic on C,=C\{~} (also at the 


ale 


point z=0). If s=z is real, one obtains 


el@_e—iz eit_e-it ging 


wie wie a 
(The holomorphic function e* has been treated in Subsee. 15.1.3.) From Cauchy’s integral 
theorem, Subsec. 15.2.2., it follows that 
AY : « 2 : 
SNH de 1 els — em 1 eld it e-& 
tere as | as | oe @) 
= c G & 
-N 
=i 
_ =e 0 @ 
-N _ Ly — fie N 
Fig. 15.32 <5 Fig. 15.33 


where € is the path of integration indicated in Fig. 15.33. If ©, and ©, are the semicircles 
indicated in Fig. 15.32, then it follows from the above theorem that 


eit eis 
ee he dz+271- 1, i = le f z dz, 


c € G; 


where 1 is the residue of — sink the pomt 0. One can now estimate that the integrals over 


©, and ©, approach 0 as VN >. Then from (2) one obtains 


oo 


NGL de 
ee 
ag 


— oo 


15.5.2. The Logarithmie Residue 


Theorem. Let 21, ..., Z, be any points in the simply connected domain G in C,. Let 
the function f(z) be holomorphic on G\({z,}U...U {z,}), and let f(z) have a pole of order 
N, at the point z, k=1,..., n. Further let f(z) have a finite number of zeros in G, 
Wy, +0) Wm, Where wy, is a zero of order M,,1=1, ..., m. If the smooth curve © encloses 
all zeros and poles of f(z) (Fig. 15.34), then 


LC) 5 ee 
rai J i de STi Lee (1) 


l=1 k=! 


Remark J. Consequently, f(z) can be expanded at z, in a local Laurent series having the 


form 
1 
1) ~ gaat Ne (Ce, w,+Ck,-wyp+tl@—%) ++), Chew, #O- 
%— ek 
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Fig. 15.34 


2 
Here N,=1,2,... Thus > N; is the sum of the poles, where their multiplicities have been 
k=1 
taken into account. By analogy, in a neighbourhood of w; one has 


M 
H(z) =(z-—w)) (di, + ia, +1 (2—w7) +..-), dium, +9. 


m 

Here M,=1,2,...,and >) M, is the sum of the zeros, with their multiplicities having been 
(p= 

taken into account. From these expansions one obtains 


Ny ie 
f(a) _ arn. -...) near 2, ‘ 
Ke) | ah 
(1+...) near ww). 
Ai; 
At all other points aa is holomorphic. Then (1) follows immediately from (15.5.1/1). 
i f(z) 


The term “logarithmic residue” has its origin in (2), since the residues of (In f(z))’= 
are calculated. 


Remark 2. With the use of (1), the fundamental theorem of algebra can be proved on the 
basis of Theorem 5.3.3/2 (and Subsec. 15.3.6.) as follows. Initially one estimates that all 
f(z) 


n 
zeros of f(z)= >. a,z* lie inside a circle € ={z | |z| = R}. Further one has —~—=— (1+...) for 


k=0 F(z) 


|z| > FR. f(z) has no poles. Now it follows from a ) that 


. we il 


= 


which is the desired result. ‘ 


15.5.3. Mapping Properties of Holomorphie Funetions 


Theorem 1. Let f(z) be holomorphic on a neighbourhood of a point z€C,, and let 
f(z) = wo + Cale — 20)” + engi (2— Apa’ oe 


in that neighbourhood, with ¢,+0. Then there are a neighbourhood U of zy and a 
neighbourhood V of wo, such that f(U)=V (Fig. 15.35). If wyeV, wy+ wo, then 
there are exactly n different points Z, ..., 2, in U, such that f(z) = wy, for k=1, ..., n. 
On the other hand there is no point z€U such that 2+ 2 and f(z)=wWp. 


(Z) (w) 


f(z) 
Vo shy 
Fig. 15.35 
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Remark 1. One has /(U)={w | 32€U such that f(z) =w}. The theorem ean be proved in an 
elegant way by means of Theorem 15.5.2: if one substitutes f(z) —w, for f(z) in (15.5.2/1), 
then the integral counts the number of points where /(z) =w,. Unfortunately the (neces- 
sary) details cannot be dealt with here. 


Theorem 2 (theorem on domain invariance). Let f(z) be holomorphic on a domain G 


in Cy. If f(z) is not constant, then f(G) = {w | 3z€G@ such that f(z)=w} ts also a do- 
main in C, (Fig. 15.36). > 


Remark 2, This theorem is an immediate consequence of Theorem 1. 


(z) (Ww) 


15.5.4. Inverse Functions 


Big. 15.36 


Theorem. Let f(z) be holomorphic in a neighbourhood of a point 2) € Cy. Let f(z) = wo 
and f(z) +0 (Fig. 15.37). Then there exist a neighbourhood U of zy and a neigh- 
bourhood V of wo, such that f(z) maps U onto V in a one-to-one way, the inverse 
function z=g(w) is holomorphic on V, and the relation 


g(w)=——, with w=f(z) 
z 


holds on V. 


Remark. The first part of the theorem is inferred immediately from Theorem 15.5.3/1. The 
second part is the eomplex analogue to Theorem 3.1.4. 


Big. 15.37 


15.6. Holomorphic Continuation 


15.6.1. The Disk-Chain Method 


If f(z) is holomorphic on the disk Ky, and if z,€ Ko, then fo(z) can be expanded 
in a power series at the point 2,: 


fo(2) - 2 ty (2—2))* (1) 


If K, is a disk with centre 2), then the absolute and uniform convergence of (1) 
on A, is ensured if Ac Ay (Theorem 15.3.2 and Remark 15.3.2/2). However, it 


1 


so 
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may happen that the right-hand side of (1) converges absolutely on a disk A, 
that extends beyond Ay (Fig. 15.38). By Theorem 15.1.3, the right-hand side of 
(1) is a holomorphic function on A, also in this case. Thus we obtain a holomor- 
phic continuation of f(z), namely 


fo(2) for zéKy, 
WO DY aie—2,)' “ior eek |. 
k=0 


The iteration of this method is called disk-chain method: along a given smooth 
curve @ in C), one attempts to continue the starting function f,(z) holomorphically 
to a point wef (Fig. 15.39). The continued holomorphic function is again denot- 


ve 


we 
Ky 
5 Ca 
Fig. 15.38 Fig. 15.39 


ed by f(z). Whether or not the point zw will be reached cannot be stated generally. 
Further the question arises as to whether the value /(w) will depend on the partic- 
ular choice of the disks in the disk-chain method. 


Theorem. The disk-chain method applied along the curve € yields a value f(w) that 
vs independent of the particular choice of the disks. 


Remark. More precisely, one should write fg(w) instead of f(w), because in general f(w) 
depends on © (but not on the disks chosen along ©). 


15.6.2. The Monodromy Theorem 


Theorem (monodromy theorem). Suppose that f(z) 1s holomorphic on the disk K 
and that it can be holomorphically continued to w along the curves Gy and ©,, in the 
sense of Subsec. 15.6.1. (Fig. 15.40). Let fi,(w) and fc,(w) be the corresponding val- 
ues. If there exists a family of smooth curves, {€s}y<s<1, which cover the domain-be- 
tween Gy and ©, ina schlicht, gapless, and continuous manner, and ef f(z) can be 
continued holomorphically to w along each of the curves ©, by means of a disk-chain 
method, then fs,(w) = fe, (vw). 

Remark 1. ‘Schlicht, gapless and continuous” means the following: for t+ s, ©, and ©, only 
have the points 2) and w in common (this applies especially also for ©) and ©,); every point 
in the domain between Gp and ©, belongs to one of the curves ©,; and ©, approaches ©, as 
|s —t} becomes small. 


Remark 2. If /(z) can be continued holomorphically from the disk K into the simply con- 
nected domain G by means of the disk-chain method, then the continuation is unique (and 
especially also independent of the path of continuation) (Fig. 15.41). This is a direct impli- 
cation of the above theorem. 


Lye) esr 


Fig. 15.40 Fig. 15.41 
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15.6.3. Riemann Surfaces 


If the domain G referred to in Remark 15.6.2/2 is not simply connected, then 
the continuation by means of the disk-chain method generally depends on the 
path. In particular one may arrive at new values of the function along closed 


paths. We continue f(z) = yz along the unit circle © (Fig. 15.42). The surface of 
existence of /z has been described at the end of Subsec. 5.3.2. On the unit circle 


oes 
one has z=e'” and Yz=e 2. Starting from, say, ¥1=1 (i.e. =0), after a full trav- 
erse of the unit circle by means of the disk-chain method one obtains the value 
i= s 
2 eS —1. This many-valuedness is avoided by introducing the Riemann 
surface for /z in the sense of Subsec. 5.3.2.: two sheets with the “branch points” 
0 and o and a slit that connects these two branch points and along which the two 
sheets are glued to one another (Fig. 15.43). (The path of this slit is irrelevant.) 


Fig. 15.42 y Fig. 15.43 


Other than in the case of the unit circle, the disk-chain method on the Riemann 


surface for /z is independent of the path. This opens the possibility to construct 
Riemann surfaces for arbitrary holomorphic functions f(z). Let f(z) be any func- 
tion holomorphic on a disk A. By means of an optimum utilization of the disk- 
chain method one then obtains the Riemann surface (surface of existence) for 
f(z). Hf these procedures are carried out for {(z) =1n z, starting with a disk around 
z=1 and the value f(1)=In 1=0, one obtains the Riemann surface for In z as 
given in Subsec. 5.3.1. Another example is 


f(z) =V(2— 2) (2-29) (2-23) @—-%), BHA 


It turns out that the corresponding surface of existence consists of 2 sheets with 
the 4 branch points z,, 2, 2, and z, and the glued boundaries indicated in Fig. 
15.44. It is easy to check that the disk-chain method on this Riemann surface for 
f(z) is path-independent. There is still a lot to tell about Riemann surfaces; in 
particular the inexact formulations given above should be made more precise. A 
fine representation has been given in [6]. 


sie 


Zz, 7 
fee % 
300 7 Fig, 15.44 


® 
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15.7. Conformal Mappings 


15.7.1. Fundamental Properties 


Definition. A continuously differentiable, one-to-one mapping u=Uu(x, y), v=v(%, Y) 
of adomain Gin Ry=C, onto a domain G’ in Ry is called conformal if it leaves the 
angles including their ortentations unchanged. 


Remark 1. The meaning is as follows (Fig. 15.45). The continuously differentiable mapping 
u=u%, y), v=v(a, y) maps G onto G’ in a one-to-one way. Consider two continuously dif- 
ferentiable curves ©, and ©, in G, the tangents of which shall include an angle «. Here « is 
considered positive if measured from ©, to GC, (orientation). Now the mapping is called con- 
formal if the tangents of the image curves ©; and © include the same angle « with the same 
orientation. 


Fig. 15.45 reflection Fig. 15.46 


Remark 2. If afand 6 are real numbers, then the translation w=x2+a, v=y+b in Ry is a 
conformal mapping. On the other hand, the reflection «=z, v= —y is not conformal: while 
the magnitude of the angles remains unchanged, their orientation does not (Fig. 15.46). 


Theorem 1. Jf the non-constant holomorphic function f(z) maps the domain G in C, 
onto G’ =f(G) in a one-to-one way, then f(z) ts a conformal mapping. Here G’ is 
again a domain. 


Remark 8. The fact that G’ is again a domain follows from Theorem 15.5.3/2. Further, since 
the mapping /(z) is assumed to be one-to-one, it follows from Theorem i 5.3/1 that f(z) +0 
for every point z€G. 


Theorem 2. Let © be a piecewise smooth, closed curve that encloses a bounded domain 
G in Cy, let z, ..., 2x be any points on C, especially including all of the corner 


points of ©. Let f(z) be holomorphic on G\ U {zp} and continuous onG. If the map- 
k= 


ping ze€— f(z) EW’ = f(C) zs one-to-one, then f(z) effects a conformal mapping of G 
onto the domain G’ enclosed by the curve ©’. 


Remark 4. Consequently {2,, ..., ¢y} contains all corner points of © at which smooth curve 

segments coincide, as well as possible other points on the smooth curve segments. Smooth 

means continuously differentiable. Further we allow © to consist (as indicated in Fig. 15.47) 

of several connected boundary components. The assumption that f(z) be holomorphic on 
NG 


N 
G\ U {z}means that f(z)is holomorphic in a neighbourhood U(z9) of every point zy€G\ U {zz}. 
p= ll peil 
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Remark 5. The value of the theorem consists in that one has only to consider the images of 
the boundaries. 


{’ Fig. 15.47 


15.7.2. Riemann’s Mapping Theorem 


Lemma. /f f(z) is a holomorphic function that effects a conformal mapping of the 
unit disk {z| |z|<1} onto itself, and if f(0)=0 and f'(0)>0, then f(z) is the iden- 
tical mapping. 


Remark 1. What is required above is that f(z) maps the unit disk K = {z | |z| <1} conformally 
onto /(K)=K, satisfying the additional conditions that f(0)=0 and f/(0) +0. The assertion 
is that f(z) =z is the only mapping that has these properties. This is a first, essential step 
towards determining all conformal mappings f(z) that map A onto itself; cf. Theorem 
15.8.7. 


Remark 2. In connection with the above lemma, the following problem arises: which do- 
mains G in C can be mapped one-to-one onto the unit disk by means of holomorphic func- 
tions? Now let -€G@ be admitted. If G is a domain in C,, then Theorem 15.7.1/1 shows that 
this mapping is conformal in the sense indicated above. Therefore in the case of arbitrary 
domains G in C one also speaks of conformal mappings, which means one-to-one mappings by 
holomorphic functions. 


Theorem (Riemann’s mapping theorem). Let G be a simply connected domain in C 
that hes at least two different boundary points. Further let zy€G, z+, and 0S9< 
<2n. Then there exists exactly one conformal mapping f(z) of G onto K = {z | |z|<1}, 
such that f(z) =0-and f’ (zo) =|f’(2o)l el”. 


Remark 3. Consequently, f(@) =K, where one may additionally prescribe which point 2p + © 
of @ shall be mapped onto 0 and what shall be the value of arg /’(zp). The uniqueness asser- 
tion is easily derived from the above lemma. On the other hand, the proof that there exists a 
conformal mapping with the desired properties proves difficult. 


Remark 4. Since G has at least two different boundary points z and z, and is simply con- 
nected, there is also a curve from 2 to 2; that consists of nothing but boundary points. If, 
for instance, G=C\G, where € is a curve from zp to z, (see Fig. 15.48), then this is a simply 
connected domain, because every closed curve in @ can be contracted to a point, if neces- 
sary, via the point ~ (for reference, see the corresponding situation on the Ricmann sphere 
(Fig. 15.49)). There arc only two types of simply connected domains which are not covered, 


% “ Wig. 15.48 [es _/ Fig. 15.49 
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namely the extended plane C and the punctured extended plane C\{z}. It is easily seen that 
C\{zo} can be mapped conformally onto C\{<}=C;. Further it is relatively easy to see that 
neither C nor C, can be mapped conformally onto K ={z | |z| <1}. After all, it is not pos- 
sible either to map C conformally onto C;. Thus the class of all simply connected domains in 
C can be subdivided into three conformal types: K, C;, and C. 


15.8. Linear Transformations 
15.8.1. Conformal Mappings of C and C, 


At the end of Subsec. 15.7.2. we stated that for simply connected domains in C 
there are three conformal types, namely the unit disk K, the complex plane 
C,, and the extended plane OC. Thus it is of interest to determine the class of all 
conformal mappings that map C onto itself, C, onto itself, and K onto itself. 
We shall return to the case of the unit disk in Subsec. 15.8.7. 


Theorem. (a) f(z) 7s a@ conformal mapping of O onto itself if and only if either f(z) = 


a 
= +¢ or f(z) =az+b. Here a, b, and c are complex numbers, and a+ 0. 
me — 


(b) f(z) 28 a conformal mapping of O, onto itself if and only if f(z) =az+b. Here a 
and b are complex numbers, and a+0. 


rl 


15.8.2. The Group of Linear Transformations 


A conformal mapping of C onto itself is called linear transformation (more pre- 
cisely: bilinear transformation). From Theorem 15.8.1 (a) it is concluded that 


Lz) is a linear transformation if and only if 
Lz) ——. with ad—be+0. (1) 


Tf Lea, with ad—be=0 and |c|+|d|/+0, then ZL(z) is constant and of no 
4 


interest for our purposes. 


Theorent. (a) The linear mappings constitute a group with the identical mapping as 
us wdentity element and the law of composition LyLy: (L,L5)(z) = EL (La(z)), 2€C, Ly 
and Ly linear transformations. 

(b) Every linear transformation can be represented as a finite product of special 
linear transformations of the form 


Lz)=z2+b, In(z)=az and Ine) ==. 


Here a and b are complex numbers, and a+0. 


Remark. In general, £,L.+ LL, which means that the group is not commutative. 


15.8.3. lnvarianee of Cireles 


Theorem. The class of ull straight Wnes and circles in C is mapped onto etself by a 
linear transformation. 


15.8.5, 15.8. Linear Transformations 157 


7 
Remark 1. Tf one regards straight lines as special circles that pass through , then the 
above theorem can also be formulated as follows: under a linear transformation, a circle 


‘ : 1 
is mapped onto another circle. The transformation L(z)=— maps a circle passing through 
the origin onto a straight line, and vice versa. fi 


Remark 2. The theorem needs only to be proved for the three standard types of Theorem 
15.8.2(b); the rest is done by Theorem 15.8.2(b). 


15.8.4. Mapping Properties and Cross Ratios 


Theorem 1. Let 21, 22, 23 be three different points in C. Further let W , Wy, Ws be three 
different points in C. Then there exists exactly one linear transformation L(z) such 
that L(zx)=wy for k=1, 2, 3. 


Remark 1. Thus one is free to prescribe 3 counter images and 3 image points. If all 6 points 
are different from o. then w=L(z), with 


ZZ, 23-2qp WW, Wz—Wy 


an (1) 


= 4 83-2, W—We W3-W; 


is the desired linear transformation. 


Theorem 2 (‘rvariance of the cross ratio). [f L(z) is a near transformation, and 
if we= Lz) for k=1, 2, 3, 4, where the points z, are pairwise different, then 


UE W3— Wo 24-24 23 — 29 


(2) 


Remark 2. (2) is obtained from (1) and the uniqueness proposition of Theorem 1. If one of 


(C85 55) W3— Wy, 20 —E9) 23-24 


co 


the points is ~, then = is treated in the usual way in the calculation, setting —=1. 


co 


15.8.5. Fixed Points and Types of Mappings 


Theorem. A /inear transformation other than the identical one has either exactly one 
fixed point or exactly two fixed points. 


Remark. What is desired are points z€C such that = L(z) =z. Such a point is called 


cz+d 
fixed point. Here o is admitted to competition. For the identical transformation L(z) =z, 
cf course, every point z€C is a fixed point. [f D(z) is not the identical transformation. then 
az+b : 
the equation —— =z has either onc or two solutions. 
cz+d 
Classification by types: If the linear transformation L,(z) has exactly one fixed 
point and if this fixed point is , then L(z)=z2+5, where b is a complex number, 
b+0. If the linear transformation /,(z) has cxactly two fixed points and if 0 and 
oo are these fixed points, then L,(z)=az, where a is a complex number, a +0, and 
a+1. Hence the two standard types are 


ij@=2+6 and Liizj=ae, where 6+0, a+0, a=1. (1) 


If L(z) is an arbitrary linear transformation that is not the identical one, then 
there exists a linear transformation / such that 


either se ii ols! or = 1h-!. (2) 
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1-1 is the transformation inverse to J. L has exactly one fixed point in the first 
case and exactly two fixed points in the second (Lp and L, have the meaning of 


(1)). 


Fig. 15.50 


Parabolic pencil of circles (Fig. 15.50): Lo(z)=z2+5 is a translation. When Ly is 
applied iteratively, the points z in C move along the solid lines, where the ortho- 
gonal dashed lines indicate the distances between counter image and image points. 
Hence the solid lines are the trajectories, and the dashed lines indicate the step 
interval in an iterative application of Lo. When / is applied, this picture is trans- 
formed into a parabolic pencil of circles: this follows from the invariance of cir- 
cles and the conformity. If L has exactly one fixed point, then the first formula in 
(2) can be used: the solid circles are then the trajectories, and the broken circles 
indicate the step interval in an iterative application of £2. The pencil point at 
which all the circles coincide is the fixed point. 


Elliptic-hyperbolie peneil of ecireles (Fig. 15.51): If one considers L,(z)=az, 
where a >0, one obtains a dilatation. The trajectories are the straight lines pass- 
ing through the origin, and the concentric circles around the origin indicate 
the step interval. If a=el”, g real, then L, is a rotation. One obtains the same 
picture, except that now the circles are the trajectories, while the straight lines 
indicate the step interval. Application of a linear transformation / gives an 
elliptic-hyperbolic pencil of circles. The hyperbolic pencil of circles consists of all 
circles passing through the points zy and z, (corresponding to the straight lines 
passing through 0). The elliptic pencil of circles consists of all circles orthogonal 
to the former ones (corresponding to the circles with centre 0). If L has exactly 
two fixed points, then the second formula of (2) can be used. zg and z, then are 
the fixed points. If in the corresponding standard transformation either a>0 
or a=e'?, then the circles of the elliptic-hyperbolic pencil are the corresponding 
trajectories. In the general case, L,(z)=az, with a=|al e”, is a rotation and 
stretching. Such mappings are also called loxodromic transformations. 
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15.8.6. Sehwarz’s Reflection Principle 


If A isa circle with the centre a€C, and the radius R>0, then the points zo 
and z, are called mirror points (with respect to A) if z3 a= Roel’, z, a = Rye’, 
and RyR,= Rk? (Fig. 15.52). For straight lines, mirror points have the usual 


meaning. 
re 


Fig. 15.52 
Theorem. Under linear transformations, mirror points are transformed into mirror 


points. 


Remark. Since circles are transformed into circles under linear transformations, the asser- 
tion of the theorem is meaningful. (As usual, we consider straight lines to be special cir- 
cles.) 


15.8.7. Conformal Mappings of the Unit Disk 


To supplement Theorem 15.8.1, let us determine the conformal mappings of 
the unit disk K = {z | |z| <1} onto itself. 


Theorem. f(z) 7s a conformal mapping of K onto itself if and only tf f(z) =e"? - a, ; 
where a€C,, lal<1, O= 9 <2r. se 
Remark 1. It is relatively easy to reduce the theorem to Theorem 15.8.6 and Lemma 15.7.2: 
let f(z) be ae mapping aoe onto itself, with f(a) =0 and a+0. Hf h(z) =, then 
h(a) =0, (7) =o, Since a and q are mirror points with respect to the unit circle, 0 and = 


must be mirror points with respect to the image of the unit circle. But this is only 
possible if this image is a circle with centre 0. From |h(1)|=1 it follows that this image is 
again the unit circle. Hence /(z) maps the unit disk onto itself. If g(z) =f(h-(z)), then g(z) isa 
conformal mapping of K onto itself, and g(0)=0. The rest now follows from Lemma 15.7.2. 
Remark 2. The transformations of the unit disk onto itself constitute a subgroup of the 
group of all linear transformations. 


15.9. Special Functions 
15.9.1. The Funetions e* and In 2 


The holomorphic function e? has been dealt with in Subsecs. 15.1.3. and 5.4.6. 
For the holomorphic function In z we refer to Subsecs. 15.1.4., 5.3.1. and 15.3.2. 
The known properties of the real-valued functions e” and In x can be transferred 
to e? and In z: for arbitrary complex numbers z,; and z) and z+0 one has 


ett2—e%e2 and eZ=z2. (1) 


Here it is irrelevant from which branch the value of In z is taken: one has 
(In z), = (In z)o+27ik and e’"*— 1. Relations of the form (1) can be proved in a 
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simple way: /(z)=e!™’—z is holomorphic, and f(x) =0 for x real, x +0. Then the 
identity theorem 15.3.3 implies that /(z)=0. In other words, identities for real 
functions can be extended to complex functions, provided that the functions 
considered are holomorphic. 


Mapping properties (Fig. 15.53): From e?=e*el” for z=x+iy, it is easy to infer 
the following mapping properties: e* maps the strip region {z | 0<Im z<2z} con- 
formally onto the plane slit along the positive semi-axis. e* maps the strip region 
{z|0<Imz<zx} conformally onto the upper half-plane. Here the marked 
straight lines are transformed into the corresponding straight lines and circles. 


Mie pai 
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ing 0 moe Fig. 15.53 


15.9.2. The Functions sin z, cos 2, tan z, and cot 2 


We extend the expansions in power series for sin 2 and cos x, as described in 
Subsec. 5.4.5., from R, to the complex plane. Thus, for z€C, let 


sin 2=2- — SS = ee an cos z=1-— aS An 
a ae hv Ta aN ae aes 


8 Zz 


sin 2 : : 
as well as tan z= and eot z=— . The radius of convergence of both series 


COS 2 sin 2 
is co (cf. Subsec. 5.4.1.). Then it follows from Theorem 15.1.3 that sin z and 
cos z are holomorphic functions on Cj. 


Theorem. All the propositions of Theorem 5.2.1 remain true if one substitutes 
z€C, for «ER, and we€C, for ye R,. 


Remark. As was indicated in Subsec. 15.9.1., the theorem is proved by using Theorem 15.3.3 
and the corresponding propositions for real z and w. Further, for all z€C, one has 


sin 2=5 (e202 coe :=5 (el + ei?) , 
Thus, for instance, sini = (e~1-e). 
eiz = ez 
Mapping properties (Fig. 15.54): From tan z= —i Aeieaaie and the mapping pro- 


perties of e* it follows that tan z maps the strip region \? | [Re 2 <7 conform- 
ally onto the unit disk. 
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15.9.3. Partial Fraction Decomposition for cot z 


From the power series expansions for sin z and cos z it follows that cot z= 


COS z . 
— has the Laurent expansion 
sin z 
ou 
22 
1—-—=+... 
2 re 
Cr 2= ts SF ooo (1) 
z 3 


at the point 0. Since cot z has the period x, the term is the main part of 


the Laurent series at the point kx. Here / is an integer. Apart from these poles of 
order one, cot z has no other poles. 


1 = 1 1 : 
Theorem 1. cot z=—+ + —— ], and this series converges uniformly 
z poile—An 2zt+kn 


tn the neighbourhood of every point z that 1s not a pole. 


Remark 1. This is an infinite analogue to the partial fraction decomposition of Subsec. 
7.1.2. Thus one simply collects all the main parts, where convergence is achieved by combin- 
ing those main parts which contain kz and —kz. 


Theorem 2. > mo (2) 


2 2a? 


I 
? ~ ! 1 ~ 22 ; : : 
Remark 2. If > = = 2 ———— is expanded in a power series at 0, then 
this gives : k=! 


aban (3) 


Comparison with (1) leads immediately to (2). In the same way it is possible to compare the 
ee 


coefficients of 23, .. in (1) and (3). Thus one obtains 


os 


e jl at el m 
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16. Principles of Hydrodynamics of Plane Flows 


16.1. The Fundamental Equations of Hydrodynamics 


16.1.1. Preliminary Remarks on Modeling 


In the next three subsections we shall take the following line to formulate a 
mathematical model for the physical problem of plane flows. 

1. By heuristic considerations we attempt to find mathematical propositions 
which characterize the physical problem. In our case this is tantamount to find- 
ing formulas that describe when a plane flow is source-free and circulation- 
free (Subsec. 16.1.2.). 

2. An intramathematical formulation of the problem resulting from these 
investigations: the real formulation of the fundamental equations of hydrodynam- 
ics (Subsec. 16.1.3.). 

3. Intramathematical reformulation of the problem, looking out for as ‘efficient 
a mathematical set of tools as possible: complex formulation of the fundamental 
equations of hydrodynamics (Subsec. 16.1.3.). 

4, Axiomatization in the sense of Subsec. 12.1.2. The mathematical theory 
takes the lead (in our case it is very simple); physics is connected to it by trans- 
lation and interpretation (Subsec. 16.1.4.). 


16.1.2. Souree-Free and Cireulation-Free Flows 


We shall consider flows that can approximately be regarded as plane flows, 
e.g. flows at a water surface or atmospheric flows in which one dimension can be 
neglected (Fig. 16.1). The flowing particles move at a certain velocity. Their 
trajectories are called streamlines. In the x, y-plane they can be characterized 


by the velocity vector w(x, y)= (p(x, y), g(x, y)) (Fig. 16.2). Then |w| =p? + ¢? is 


G 
W 
4 ie 
é x 
Fig. 16.1 Fig. 16.2 


dy _ g(a, 7 


dx es. 

streamline. This formulation already rene a first, essential assumption: 
we only consider steady-state plane flows, i.e. flows that are independent of 
time. Further we impose the conditions that these flows have no sources, sinks, 


defines the direction of the 


the magnitude of the velocity, and 
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or vortices. Consider a simply connected domain 2 with a smooth boundary @2 
that is fully included in the flow region G (Fig. 16.3). The domain @ is said to be 
free of sources and sinks if the amount of liquid (or air ete.) that flows into 2 per 
unit of time is equal to the amount flowing out of 2 within this unit of time. Let 
ds be the element of length of 02, and let » be the exterior normal. Then the 
amount flowing through ds per unit time is c- ds - cos (y, w) lwl=cd sv: w, 
where ¢ is a factor of proportionality (Fig. 16.4). w,=»- w is the normal compo- 
nent of w. Hence freedom of sources and sinks means that /f w,ds =0 for every 
a2 


domain 2 of this type. One has »=(cos (v, x), cos (», y)), and hence w,= 
=p cos (v, x)+q cos (», y). Then it follows from Theorem 9.3.1/2 that 


op @ 
i (eo) drdy=0. As this equation must hold for every admissible domain 


2 
Q, one finally obtains 


Op(x, q(x, 
p(x w g(x Yo 


a of in the flow region G. (1) 
JE : An av 
4 
ae 
a 2 ds _ Ww 
Fig. 16.3 ds-cos (vw) Fig. 16.4 


The vector 4=(—cos (», y), cos (v, x)) is orthogonal to », and hence it is the nor- 
malized tangent vector to the curve 02. Then w,=w-t= —p cos (», y) + cos (», x) 
is the tangential component of w. Hence the property of local freedom of circu- 


lation means that fwds=0 for every admissible domain 2. From the 
0Q 
above representation of w, and from Theorem 9.3.1/1 it then follows that 


le ~) dzdy=0. As this equation must hold for every admissible domain 
y ox 
Q, one finally obtains 


Coles) = eq(, ¥) 0) in the flow region G. (ep 


oy ox 


16.1.3. Real and Complex Fundamental Equations 


Fundamental equations (real variant). Let p(x, y) and q(x, y) be real functions 
that are continuous and continuously differentiable on a domain G in fy, where 


they satisfy the conditions 
. : opt: ? nar) 
Op, 9), 240% Y)_ 9 ang we y) _@g(ts ¥) _ 9 (1) 
Ox a] oy Ox 
The streamlines of the corresponding source-, sink-, and circulation-free plane 
flow are the solutions of the differential equation p(x, y) dy—q(x, y) dx=0. The 
velocity vector of the flow at the point (2, y)€G@ is w(x, y)=(plx, y), (7, ¥))- 
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dy 9(z, y) : ; 
Remark 1. Tf p(«, y) +0, then =-=—-—_. According to the existence and uniqueness 
dz p(z, y) 
theorems for ordinary differential equations as stated in Chap. 4, then for every point 
(29. Yo) CG there exists (at least locally) a unique streamline that passes through this point. 
From (1) it follows that pdy—qda =0 is an exact differential equation as defined in Subsec. 


10.1.4. 


The stream potential. The above formulation reduces the problem to the solu- 
tion of ordinary exact differential equations. Here the role played by the condi- 
tions (1) is not quite clear. It shall be attempted to find a formulation in which 
the conditions (1) are taken into consideration in a natural way. Let again @ bea 
domain in Ry, and let Py be a point fixed in G@ (Fig. 16.5). If y is a continuously 
differentiable curve in G that connects Py with P~ (xr, y)€G@, then we set 


dé dy, 
u(x, y)= [ (oe. 7) qs t 7) =) ds , (2) 
fe 
; d dé 
vin f (ven F208, 9) ds, (3) 
Fig. 16.5 


where y is given by (&(s), 7(s)), with the are length s as a parameter. eas), 
o(x, y) and f(z) = f(x +iy)=u(x, y) +iv(z, y) are called velocity potential, stream 
function, and stream potential, respectively. 


Theorem 1. Let G be a simply connected domain in Ry. Then u(x, y) and v(x, y) are 
puth-independent and satisfy the Cauchy- Riemann differential equations on G: 


Ou : Ov ou ov 


oe a, Oe y) and A Tees y) - () 


f(z) ts holomorphic on G. 


Remark 2. u(x, y) and v(x, y) are independent of the path y. Hence, if in Eqs. (2) and (3) one 
replaces the curve y by a different curve y’ (see Fig. 16.5), then the values of u(x, y) and 
v(x, y) are left unchanged. This also justifies the notation used (otherwise in (2) and (3) we 
should have written, say, uw, and v, instead of wu and », respectively). For the proof of the 
path-independence, Eqs. (1) are required in a natural way. A problem of the same kind has 
been dealt with in Subsec. 10.1.4. The fact that f(z) is holomorphic if the conditions (4) 
are satisfied is a consequence of Theorem 15.1.4/1. 


Theorem 2. With the assumptions of Theorem 1, w(x, y)=/'(z) is the velocity 
vector. The streamlines are determined by v(x, y)=c. 


Remark 38. The first proposition follows from 


7 du, dv ‘ 

(Z) =p —i apnea ; 

where w(x, y) is uow represented as a complex number whose real and LMaginary parts are 
the respective components of the vector w(a, y). The second proposition can be deduced from 
Theorem 10.1.4, where the function denoted by M(x, y) there coincides with the above func- 
tion 2(a, y) (ef. Remark 1). 
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Fundamental equations (complex variant): Let @ be a simply connected domain 
in Ry, and let p(x, y) and q(x, y) be the functions considered in the real variant 
of the fundamental equations. If v(x, y) and /(z) have the meaning indicated 
above, then the streamlines can be calculated from v(x, y)=c, and the velocity 


vector is given by w(r, y)=/'(z). 


Remark 4. What is annoying in this version is the restrictive condition that @ be simply 
connected. 


16.1.4. The Mathematieal Model 


Axiom. If G is an arbitrary domain in Ry, then a source-, sink-, and locally circula- 
tion-free flow is described by a function f(z) that ts holomorphic on G. The streamlines 


can be calculated from Im f(z) =c, and the velocity vector vs given by w(x, y)=f (2). 


Remark 1. Hence, if /(z)=u(z, y)+iv(z, y), then w(z, Y) =(p(z, y), g(a, y)), where p= 


== =o and q= 3%. From this it follows that 
dp oq op aq 
—+—=—_—— = G@ . 
dx dy dy az 


ov 
Thus the conditions (16.1.3/1) are satisfied. From v(x, y)=c one obtains Or ie da + 


v 


+5 dy=p dy—q dz. Thus we have achieved a full correspondence to the real variant 


of the fundamental equations of Subsec. 16.1.3. Moreover we have got rid of the awkward 
constraint of the eomplex variant of the fundamental equations, namely that G be simply 
connected. 


The model: In the sense of Subsec. 12.1.2., we thus have the following situation 
(Fig. 16.6): the mathematical theory consists in determining the level curves 
c=v(xr, y)=Im f(z) of holomorphic functions f(z). Given an actual flow, then 
there arises the following translational problem: find (or guess) the holomorphic 
stream potential /(z). The interpretation is then clear: the level curves of Im f(z) 


are the streamlines, and w(x, y)=/’(z) is the velocity. 


F(Z) f (2) 
He | ge eet ol 
inter — 
pretatian 
v(x,y) streamlines 
w velocity Fig. 16.6 


Remark 2. We proceed here as follows. First we investigate simple holomorphic functions, 
thus providing for a stock of elementary flow patterns. By combining such basic flow pat- 
terns we will then arrive at more complicated flow types that give a mathematical descrip- 


tion of actual flows which are of physical interest. 
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16.2. Flow Regimes 
16.2.1. Stagnation Point Flow and Multipole Current Flow 


Parallel flow (Fig. 16.7): For a=«+if complex, /(z)=az is regarded as a stream 
potential. One has /’(z) =a and Im f(z) = 2+ ay. The flow in question is a parallel 


flow with the streamlines Bx+ay=c and the velocity /’(z)=d. The flow region 
can be chosen to be G=C;,. 


SS, 


‘ig. 16.7 


Stagnation point flow (Fig. 16.8): Let the stream potential be f(z)=2”, n =2, 3, ... 
Then in polar co-ordinates one has f(z)=r"%el”? and f’(z)=nr"-!e7@-Di?, The 
streamlines are given by r” sin np=c. One obtains a stagnation point flow with 
G = C',\{0} as its flow region. 


Fig. 16.8 


Multipole current flow (Fig. 16.9): Let the stream potential be /(z)=27”, 
where n=1, 2, 3,... In polar co-ordinates one has f(z)=r7"e7i"?, and f’(z)=. 
=n @ ele, The streamlines are given by r?=c sin ng. One obtains a 
multipole current flow with G=C,\{0} as its flow region. 


Dipole current flow (Fig. 16.10): A special case of interest is that of a multipole 
1 

current flow of type f(z)=—, called dipole current flow. The streamlines are 
2 


given by r=c sin 9. One obtains «2 + y?=r2=er sin p=cy. Hence the streamlines 
are a parabolic pencil of circles (cf. Subsec. 15.8.5.). 


Fig. 16.9 Fig. 16.10 
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16.2.2. Swirl Flow Regimes 


Sources and sinks: Let the stream potential be f(z)=In z. In polar co-ordinates 


this means f(z)=Inr+ip and }’(z)=—e'!?. The streamlines are given by y=c, 
r 


i.e., the straight lines passing through the origin. One obtains a source (Fig. 16.11) 
with G=C,\{0} as its flow region. If one starts from f(z)=—Inz, then /’(z) = 


he eee 
= e?. One obtains a sink (Fig. 16.12) with @=C,\{0} as its flow region. 


source Fig. 16.11 sink Pig. 16.12 


Remark. The flow regimes considered have been assumed to be free of sources and sinks. 
This is only a local statement that applies to a neighbourhood of an arbitrary point 2€G 
and does not contradict the above examples (0 does not belong to the flow region). 


Vortices (Fig. 16.13): Let the stream potential be f(z)=ia ln z, where a is a real 


: : — ia , 
number. In polar co-ordinates one has f(z)=ia In r—ag and /’(z)= —— e'”. The 
streamlines are given by r=c, i.e., they are circles with centre 0. : 


vortex Fig. 16.13 


Swirls (Fig. 16.14): It is possible to combine sources and sinks with vortices. 
Here the actual combination corresponds to the combination of the stream po- 
tential. Let the stream potential be f(z)=(«+i8) nz, where a and @ are real 
numbers, 6+0. Then the streamlines are given by 6 lnr+agy=c, which gives 


In r=5-3 vy, ge R,, that means logarithmic spirals. 
source vortex swirl Fig. 16.14 


16.2.3. Profile Flow Regimes 


The hitherto considered elementary flow regimes exhibit only points as singular- 
ities. The question as to how does a flow respond to an obstacle, or what is the 
pattern of flow around a given profile, is of much greater interest (from the phys- 


13* 
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ical point of view). This is closely connected with the following question: what 
happens when additional sources or sinks are put into a given flow? The latter 


that is by combination of suitable elementary flow regimes and of the correspond- 
ing stream potentials. 


Parallel flow with a source (Fig. 16.15): Consider a parallel flow (in C,) which is 
disturbed by a source (at the point 0), for instance the discharge of chemical 
sewage into our Saale river, which might become too clear otherwise. Intuitively, 
one should expect something like the situation shown in the figure. As the 
parallel flow and the source are known to have the stream potentials z and In z, 
respectively, it suggests itself to set the combined stream potential to be 
f(z)=z4+lnz. If x, y are Cartesian co-ordinates and r, g are polar co-ordinates, 
then for the streamlines one obtains Im (z2+I1n z)=y+ =e. Let -z<p=n. One 
obtains the pattern of Fig. 16.15. The interesting limiting curve is y+q=7, or 


fe é : ll 
y+o= —n. It joins the x-axis at —1 at right angles. One has f’(z)=1 ee and 


hence f(—1)=0, which means that —1 is a stagnation point. But the above 
pattern can also be regarded as a profile flow, namely the flow around the profile 
described by the curve y+y=7, or y+ y= —n. The stream potential is f{(z)= 
=z+In z, and it is only the external part of the pattern that is of interest in 
this case. 


parallel flow aatince 


F(z)=z fiz)=inz fiz)=ztinz 
Fig. 16.15 


Flow around a cirele (Fig. 16.16): Consider a parallel flow (in C,) and a dipole 
flow with 0 as a singular point. One obtains the situation shown in the figure. 


Is i 
The streamlines are given by (r-=] sin p=c in polar co-ordinates. Hence the 
. 


unit circle, r=1, is a streamline with c=0. If OS p<z and c<0, then r<1 
(distorted dipole flow). If Os <x and c>0, then r+1, that is a flow around the 


eg. 1 
unit circle. The result for —n<g<0 is analogous. From f’(z)=1—=0 one 
obtains z= +1 as stagnation points. is 


Se f(zi-$ TZ) = 29 
Fig. 16.16 
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vortex 
7 


fizj=z+ F(z) =falnz 
(a>0) ea a Os 


Fig. 16.17 


Stagnation point flows (Fig. 16.17): We consider the flow around the unit 
circle mentioned in the last example (where now the streamlines inside the 
circle are of no interest), and combine it with a vortex (with 0 as a singular 


ae if 
point). The stream potential is f(z)=z+—+ialnz, where a>0. In polar co- 
z 


: 1} 
ordinates, the streamlines are given by (7 — al sin g+alnr=c. In particular, 
e 


the unit circle r=1 is a streamline. Thus one again obtains a flow around the 
unit circle. The stagnation points resulting from /’(z)=0 are of interest. From 


1 ia 
f@=f— 5 o a 0 one obtains 


Here three cases can be distinguished, where it is clearly seen that the effect 
of the vortex increases with increasing a@ (Fig. 16.18). 


eee 
q 47 g ee JE) 
Phe 


02a<2 a=2 a>2 
Fig. 16.18 


16.2.4. Conformal Mappings in Hydrodynamics, Zukovskij Profiles 


Conformal Mapping: Let f(z) be the stream potential in a flow region G (Fig. 
16.19). The streamlines are given by Im f(z)=c, the stagnation points can be 
obtained from /’(z)=0. If w=w/(z) is a conformal mapping of G onto G’, then we 
consider the transformed stream potential g(w) =/f(z(w)). From Im g(w) = Inn f(z) 
it follows that the images of streamlines are again streamlines. From g’(w)= 
= f'(z(w)) - 2’(w) and 2’(w) +0 it follows that the images of stagnation points are 
also stagnation points. In other words, conformal mappings transforin the 
flow pattern as a whole. It is clear that on this basis it is possible to construct a 
great number of new flow patterns from the examples treated so far. 
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Liz * (if; 


Fig. 16.19 


Zukovskij profiles: Our aim is to describe the flow around lifting surfaces 
(e.g. of airplanes). For that purpose we use the principle of conformal mapping 
as described above. Before doing so, let us investigate some conformal mappings 


(Fig. 16.20). The linear mapping v=) = maps the exterior of the 
z 


Fig. 16.20 


circle in the z-plane, as indicated in the figure, onto a half-plane, which in its 


1 _ 
turn is mapped onto a slit plane by w=v2. Finally, the linear mapping w= il a 


= 
yields an image which is a plane slit by a circular arc. The combined map- 


1 1 
ping is w=> (:++) . We now consider a flow around a circle A as described in 
2 z 
Subsec. 16.2.3., where A touches the broken circle at the point 1 (Fig. 16.21). 
: ee 1 Ie 
The image of K obtained by application of w= > (:+=] is drop-shaped. Here 
2 z 


the angle ¢ and the circle K can be varied. One then obtains slender and thick 


(u) 
¢ : 
u-4(z+2) i 


Fig. 16.21 
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Ve 
an Fig. 16.22 


profiles, as well as prolate and greatly curved ones. According to the principle of 
conformal mapping, u = +2] maps streamlines onto streamlines. Thus one 
obtains patterns of filone arcmin lifting surfaces (Fig. 16.22). As the stream 
potential f(z) is explicitly known for the flow around a circle [a suitable linear 
transformation of +5] , the stream potential g(w) =/(z), with w =5 (-+=) , 18 


also explicitly known for the profile of the lifting surface. For lifting surfaces, 
the question for the lft A=X+iJ} is of special interest. It obeys Blasius’ 
formula 


Y= =a jg dz , 
0 


where & is the curve flown around and c is a material constant. 


17. Elements of Geometry 


17.1. Geometry of Space Curves in Rs 


17.1.1. The Moving Trihedral 
Curves in R, have been dealt with in Subsec. 8.1.5. Now let » =3. We consider 
the space curves 
ax(u) =(24(u), xo(u), r3(z)) 


where uw is a parameter, u€[a, b], ~-o<a<b<~—. 


Ye 
Uy 
%3 


Va i 
xX(u) ‘ 


Fig. 17.1 
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For functions x,(w) that are continuously differentiable, we had determined 
the are length ie a(u) in Subsec. 9.2.3. We found that (Fig. 17. 1) 


Care) ae dary (a2) me das ay q 
Or Oe 
f Z| dy dy 
Apart from fe initial point and the orientation of the curve, s is unique. Hence 
s 1s a natural parameter that we shall use from now on. We shall denote deriv- 


atives with respect to s by a prime “”’’. In particular one has 
i 


1 dae ay 1) (d3)" “2 
af == : aS -— = il, Be 3) 
eE\3) du ( du (52 a du sal 


Thus, according to Subsec. 8.1.5., for the tangent f=¢(s) to the curve x(s) one 
obtains 
{= (24(s), 25(8), 23(8)) with le] =1. 


If ¢’(s) +0, then 
ns) = 


lé’(s)| 
is the normal, and 
b(s) =¢(s) x n(s) 
is the binormal. From ¢: FG (t- ja (1)’=0 it follows that n is orthogonal 


to ¢. 6 is orthogonal to ¢ and n, and further |n|=|¢)/=1. Combining the three 
vectors, we have an orthonormal right-handed trihedral (¢, n b,), called the 
moving trihedral (of the curve) (Fig. 17.2). Here we are using well-known nota- 
tions of analytical geometry: if a=(a;, a), a3) and b=(b4, by, bs) are vectors in 
Ry, then a» b=a4by +a 9b) +a3b3 denotes the scalar product, and a Xb =(aob3— 
— 43), 436;—a4b3, ayby—agb,) denotes the vector product, which is orthogonal 
to both a and 6. (Fig. 17.3). We recall the mnemonic rule that the components 


* * * 
of a xb are the subdeterminants of |a, a ay| (including the signs). 
b, by bg 
a p axb 
b 
Fig. 17.2 a lene, 033 


17.1.2. Frenet’s Formulae 


Theorem. Let x(s) bea space curve in Rs that 7s three times continuously differenti- 
able, with s as a parameter, s)<s<s). Further let x’’(s) 0 for sy<s<s,. Then there 
exist a positive function x(s) and a real function t(s) such that 


= xn, 


n= —xt +1b, sy<s<s, (Frenet’s formulae) 
b= —tn, 
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Remark 1. As ¢(s), (s), b(s) span the R5, one has, for example, ¢’(s) =a,(s) (8) +a@o(s) n(s) + 
+a,(s) 6(s), with analogous relations for n’(s) and b’(s). From the orthogonality of t, n, b and 
the above construction it then follows that the coefficients must be as given in the theorem. 


Remark 2. If 2”(s) +0 does not hold for the entire curve, then it is decomposed into curve 
segments for which either x’(s) +0 for all s or x(s)=0 for all s. If #’(s) =0, then a(s) isa 
straight line segment. 


Remark 3. x(s) is called the (first) curvature, t(s) the torsion (second curvature) of a space 
curve. From 


t(S) —t(s 
ve(82) (89) =t"(¢0) = Jim =e 


it follows that x(s) is a measure of the deviation of the curve 2(s) from a straight line (Fig. 
17.4). ¢ and n span the “‘osculating plane” (or plane of curvature). As we shall see in Subsec. 
17.1.3., t(s) =0 if and only if x(s) extends within a fixed plane, which is then the osculating 
plane for all s. In the general case, t(s) indicates to what extent the curve turns off the oscu- 
lating plane. 


t(s,) t{s,) rt) 
a 


n t(s,) 


x(s) Fig. 17.4 


Remark 4. Consider a circle 


§ _ & 
x,(s)=R cos aD %o(s)= RFR sin PR? 


x3(8) =0, where s is the arc length. 


1 
It is readily verified that x(s) aR and t(s) =0. 


17.1.3. Plane Curves 


Theorem. A curve x(s) in Ry which is three times continuously differentiable and 
for which x’'(s)+0 extends in a fixed plane if and only tf t(s)=0. (Here s is the 
are length, and 8)<s~<8}). 

Remark 1. Confer Remark 17.1.2/3. 


Remark 2 (evolute of a plane curve; Figs. 17.5 and 17.6). For plane curves a(s), Frenet’s 
formulae reduce to 


t=xn and n’=—xt. 
1 
In accordance with Remark 17.1.2/4 we call A(s) sa) the radius of curvature. The evo- 
lute y(s) of the curve x(s), i 
R n(s) 

y(s) =2(s) + R(s) n(s) =2(9) + 
is the locus of the so-called centres of curvature. From Frenet’s formulae it follows that 
y(s) =x(s) + S pe a me n. This implies that y(s) is also the envelope of the normal. 

Pa x2 x2 


provided that x’(s) +0. 
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x5) 


(5s) 


17.1.4. Existence and Uniqueness Theorem 


Theorem. Let x(s) and t(s) be functions continuously differentiable for sy<s<sy. 
Further let x(s)+0. Then there exists a space curve x(s) in Rs such'that s is the 
are length, z(s) us the curvature, and t(s) the torsion of the curve. This space curve 
x(s) is unique in R,, except for congruent transformations. 

Remark. If any two space curves x(s) and %(s) have identical curvature and identical torsion, 
then &(s) can be transformed into 2(s) by a translation combined with a rotation in R3. The 
intrinsic geometry of the curve is thus well-determined by x(s) and t(s). Therefore x(s) and 
t(s) are also called the natural equations of the curve. 


17.2. Hyperbolic Geometry 


17.2.1. Principles of Axiomatic Geometries 


About 325 B.C., Euclid, in his ““Elementa”’, made an attempt at an axiomatic 
formulation of plane geometry, including the concepts of points, straight lines, 
and circles. Later on these axiomatic geometries were substantially refined, 
including new aspects (by D. Hilbert about the turn of our century). The aim 
is to put a few propositions (axioms) at the top and derive all other propositions 
from them by purely logical deduction, without having recourse to intuition. 
Even the fundamental concepts are defined axiomatically, i.e. with no content. 
This reads, for instance, as follows. The elements P of a set M are called points. 
Certain subsets g of M are called straight lines. Now definitional statements 
are made on the interrelations between points and straight lines as well as 
between straight lines and straight lines, etc. We shall give three examples 
below. 

1. If g, and g, are two straight lines, then either g,=g» or g,Qg.=9, or there 
exists exactly one point P such that Pég,N gq. 

2. If P,; and Py, are two different points, P,;+ P,, then there exists exactly 
one straight line g such that P,€g and P,€g. ° 

3. (Euclid’s parallel axiom). Hf g, is a straight line and P is a point such that 
P¢q, then there exists exactly one straight line g) such that Pé€g, and g,Qg.=9. 

This is not an exhaustive enumeration, but a few typical examples. The 
parallel axiom played a decisive role in the history of mathematics. It is much 
more complicated than Euclid’s other axioms. It has been attempted to derive 
this axiom from the other ones. Not earlier than in the first half of the 19th 
century, Gauss, J. Bolyai, and Lobatéevskij found out that Euclid’s parallel 
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axiom was not derivable from the other axioms of Euclid. For it can be replaced 
by either of the two following parallel axioms 3’ or 3” (keeping all the other 
axioms of Euclid), which yields a consistent geometry in either case: 

3’. If g, isa straight line, and if P isa point not of g,, then there does not exist 
any straight line g. such that Pégs and g,M gq. =O (elliptic geometry). 

3”. If g, isa straight line, and if Pisa point not of g,, then there exist infinitely 
many straight lines gy such that P €g and gig. =0 (hyperbolic geometry). 

What is desired are models of such geometries, and the purpose of the consid- 
erations in the following subsections will be to describe the so-called conformal 
model of hyperbolic geometry. 

Another problem involves the measurable quantities in a goemetry. In 
Euclid’s plane geometry, these quantities are, for example, lengths, areas, 
angles etc. These quantities are invariant under congruent transformations 
of the plane, i.e., under translations and rotations. This simple statement has 
been generalized substantially by F. Klein in his “Erlanger Programm” in 
1872. It defines geometry as the theory of invariants of a transformation group. 
Hence, what is given is a group of transformations defined on a certain funda- 
mental set M. The only quantities to be regarded as geometrically admissible 
are those which are left unchanged under every transformation of the group. 
We shall pursue here this aspect, too. 


17.2.2. A Model of Hyperbolic Geometry 


Following the notation of Subsec. 17.2.1., we define: M = {z| |z|<1} is the 
unit disk in the complex plane. The points P are the usual points of the unit 
disk. The straight lines g are the circles orthogonal to the unit circle, i.e., those 
circles which intersect the unit circle at right angles (Fig. 17.7). The transfor- 
mation group consists of all conformal mappings of the unit circle onto itself. 
These are the linear transformations referred to in Theorem 15.8.7. According 
to Theorem 15.8.3, linear transformations map circles onto circles. Since the 
unit circle is mapped onto itself and angles are left unchanged under the linear 
transformations of Theorem 15.8.7, orthogonal circles are transformed into 
orthogonal circles. In other words, the transformation group maps the above 
points P again onto points and the above straight lines onto straight lines 
(as it should also be expected of a reasonable geometry with a transformation 


group). 
SS x) 
a, 


Fig. 17.7 I,°R"? Wig. 17.8 


Theorem. (a) Zf g, and go are any two straight lines, then erther gy = go or gi! go =), 
or there exists exactly one point P such that Peg, gy (Fig. 17.8). 

(b) If P,and Py are any two different points, then there exists exactly one straight 
line g such that P\ég and Py€q. 

(¢) If g, ts a straight line, and if P isa point not of gy, then there exist infinitely 
many straight Lines gy such that P€gy and gi go= (parallels) (Hise (29): 


i 
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Remark. These are Axioms 1,2 and 3” of Subsec. 17.2.1. 


Ky) parallels to 


RY g, through P 
Fig. 17.9 


17.2.3. Distanees, Angles, and Triangles 


Definition. (a) Jf z and z, are any two different points of M, and if Cy and Cy 
are the points of intersection of the connecting straight line with the unit circle, then 
1, So-% $1—2o 
d(zq, Z;) == In -——— + —— (1) 
2 Z2=> rea years 
us the (hyperbolic) distance of z and 2, (Fig. 17.10). 
(b) The (hyperbolic) angle between two straight lines g, and gy is equal to the 
Euclidean angle between the associated tangents (Fig. 17.11). 
(e) Triangles are formed by three straight line segments (Fig. 17.12). 


55 


Sr length Wig. 17.10 angle Fig. 17.11 


triangle Fig. 17.12 


1 
Remark 1. d(z, 2,) is = In of a cross ratio in the sense of Theorem 15.8.4/2. That theorem 


also shows that the cross ratio is invariant under linear transformations (and hence especially 


: : Coz —2 

under the above transformations). It can now easily be shown that = 0 ae >1. Thus 
o—% S12 

d(z, 24) is a positive number that is invariant under the transformation group admitted. We 

set d(zq, 2;) =0 for 29 =2,. Since the mappings of the transformation group are conformal, 


part (b) of the definition is also meaningful. 


Remark 2. It is seen immediately that d(zp, 21) =d(z,, zo). If we regard 2, as fixed and let 
zy>Co, it follows that d(zo, z,) +: so the straight lines are of infinite length. Consequently, 
an “inhabitant” of the unit disk who commands a hyperbolic perception lives in an infinite 
world. 


Theorem. (a) /f 2, 2, and z are three points on a straight line, where z 1 lies be- 
tween z and 2, then d(zq, 2) =a (zy, 2) +d (21, 29) (Fig. 17.13). 
. &(0, 2) 
(b) him pe ile 
(e) Lf a, Band y are the interior angles of a triangle, then 0Sa+B+y~<n (Fig. 
17.14). 
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Fig. 17.13 a+ hry =0 T-lu+Bty)sMall Wie, 17.14 


Remark 3. (b) means that at the point 0 the conditions are approximately Euclidean. 


Remark 4, Other than in the Euclidean casc, the sum of the three angles «, 8, and y of a 
triangle can assume every value from (and including) 0 to (and excluding) z. 


17.2.4.  Cireles 

Definition. Zf 29¢ M and r=0, then K={z|z€M, d(z,z)=7} is a (hyperbolic) 
circle. z ts called the centre of Kk. 

Remark 1. As M is an infinite world, one obtains a circle for every z9¢ J/ and every r>0. 


Theorem. The (hyperbolic) circles with centre z, are the common Euclidean circles 


Be 


in M which belong to the elliptic pencil of circles with the centres z and 
(iio. 14.15). 


Fig. 17.15 


Remark 2. The elliptic pencil of circles (with the centres 2) and z,) has been described in 
Subsec. 15.8.5. 2) and — are mirror points with respect to the unit circle in the sense of 
Be . 


Theorem 15.8.6. Then it is relatively easy to show that, in particular, the unit circle belongs 


to the elliptic pencil of circles with the centres 2) and —. 
*0 


17.2.5. Are Length and Area 


The (hyperbolic) length of continuously differentiable curves © in M is deter- 
mined by the same method as in Subsec. 9.2.3.: approximation of the curve by 
polygons, where a polygon consists of (hyperbolic) straight line segments (1.e., 
of segments of circles that are orthogonal to the unit circle), the ends of which 
lie on ©. The supremum of the lengths of these polygons is the length / of the 
curve & (Fig. 17.16). 

Theorem 1. A continuously differentiable curve in M with the parametric represen- 
8 
8 . 
tation 2(s8), 8 =S=81, hus the (hyperbolic) length L= leecar where s is the 
Euclidean length of arc. fs : 
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Fig. 17.16 Fig. 17.17 


dad 
Remark 1. The formula suggests defining qc as an infinitesimal element of area in J/. 
Theorem 2. Jf B vs a (hyperbolic) bounded domain in M, the boundary of which 
consists of piecewise smooth curves, then the (hyperbolic) area of this domain 
(Fig. 17.17) 7s given by 


Remark 2. This formula can also be obtained by first defining the area of (hyperbolic) 
rectangles (formed by elliptic-hyperbolic pencils of circles), and then exhausting an arbi- 
trary surface by rectangles of this kind. 


Remark 3. The area of J is infinite, because 


= oor 


fl 
dady s [ rdr 
J (dar)? 


oh NE 
Osiz|<! 


17.2.6. Area of Triangles 


Theorem. Jf B ts a triangle with the angles x, B and y, then | B| ee =) 
(Fig. 17.18). 4 


- 


Fig. 17.18 


Remark 1. In Theorem 17.2.3 we had already found that «+f+y<7. 


Remark 2. Many properties in hyperbolic geometry are analogous to corresponding proper- 
ties in Euclidean geometry. The theorem shows, however, that there may also be entirely 
new effects which have no counterpart in Euclidean geometry. 


17.3. Geometry of the Hilbert Space 
17.3.1. Hilbert Spaces 
Definition 1. Let H be a complex linear space. A scalar product on H assigus « 


complex number (x1, %) with the following properties to ervery ordered pair 
ry CH and x€H: 
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1. (Ayry + Aare, 2g) =Ay(29, £3) + Ay(wo, 25) . 
on Gti Ae (ra, 21) 
3. (x, ce) >0 for x#0, where one sets |||] =V(x, 2). 


Here 2, to, x3 are elements Oy H, and 2, ay are complex numbers. 


Remark t. The concept of the complex linear space has been explained in Subsec. 6.1.1. 


Theorem, (a) (x, 0)=(0, x)=0 for all x€H (in particular, ||0|| =0 
) Tf x4, x2, 3 are elements of H, and if Ay, Ay are complex numbers, then 


(20g) Apry t+ Ayre) = A(x, 14) + Ay(xy, Xp) - 
(c) (Schwarz’s tnequality). For x«,;€H and x¢€H, 
(rp o)l Sllzyll [frat - 


Here \(x,, ty)! =||x4l| ||xol| ¢f and only if x,=0 or xy =0 or x)= Axo. 

(d) ||2 || %s @ norm. 

(e) Lf |lxzn—2x||--0 as n+, then |lxp||>|lx||. Zf, moreover |lyn—y||~0 as n>, 
then (an, Yn) (x, y). 


Remark 2. Thus |\«|| satisfies the conditions of a norm in the sense of Definition 6.1.1/1. 
Therefore a complex linear space with a scalar product is also a normed linear space. From 
now on we will use the terminology for normed linear spaces, which will always refer to 
|z|| in the sense of the above definition. 


Definition 2. A complex linear space H with a scalar product is called Hilbert 
space if it 7s complete and if there exists a countably infinite, dense subset in H. 


Remark 3. Consequently, in order that a complex linear space H with a scalar product be 
called a Hilbert space, two conditions must be satisfied: 

1. H must be a complex Banach space in the sense of Def. 6.1.1/2. 

2. There must be a set {xj}; 1 CH that is dense in #1. 

The conditions for a set to be dense in a Banach space have been described in Def. 14.6.3. 
It is customary not to include the last requirement in the definition of a Hilbert space; 
accordingly, a Hilbert space is a special complex Banach space whose norm ||| is derived 
from a scalar product in the sense described above. A Banach space JB is called separable if 
there exists a dense set {x;};— ; C B. In the sense of the usual terminology, we have thus defin- 
ed by Def. 2 under which conditions a complex linear space H is a separable Hilbert space. 
However, since Hilbert spaces other than separable ones are of no interest for us here, we 
directly chose the above formulation. 


Remark 4. We restrict ourselves to complex spaces here. All the definitions are, however, 
also valid for real linear spaces. In particular one may also speak of real (separable) Hilbert 
spaces. 


17.3.2. Examples of Hilbert Spaces 


The space Cy: If r=(xy, ..., m2) € Cn and y=(y,, ---) Yn) € Cn, then one sets 
nm 
tae y) = XjYj as 
This is a scalar product, and one has C, =/5.¢ in the sense of Subsec. 6.2.2. It 
is easily seen that {r le= (x), .-. a), Re xg and Im x; rational} is a countable, 
dense set in C,. Hence C, is a Hilbert space. 
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The space lz: We set 1,=l.¢ in the sense of subsec. 6.2.2. If r= (a1, xo, ...) El, 
and y=(y4, Yo, «--) Elo, then 


(t Y= D2 aii) 


a 
is a scalar product. According to Schwarz’s inequality one has |(x, y)|= 
lI llvla<cc, so that the product formation is meaningful. (In this case 
Schwarz’s inequality can also be derived from Hélder’s inequality referred to 
in Subsec. 6.2.1.) The vectors r=(a,..., xy, 0,0,...), where Re x, and Im a, 
are rational, are dense in /,. Hence dy is a Hilbert space. 


The space £2(8): For 2 being a domain in R,, we had introduced the real space 
L,(2) in Subsec. 14.6.4. In the same way it is possible to define the complex 
space L,(Q): It is the system of all complex-valued functions /(2) (more pre- 
cisely: of equivalence classes of functions) such that |f(x)| belongs to the real 
space Lo(Q). As in Subsec. 14.6.4. one has 

t 


Wfllzacoy=(f LAP dzx)2, (1) 


We make the convention that from now on £5(2) shall always denote the com- 
plex space L,(2). The properties of the real space L,(Q) are also valid for the 
(complex) space L,(2). In particular, L,(2) with the norm (1) is a Banach space. 
(1) is derived here from the scalar product 


(f, 9) E42) = f(x) g(x) da. 


Theorem 14.6.2 shows that (f,g)z,e) is meaningful. With the use of Theorem 
14.6.3 it can be shown that the set 


N 
if | Tae riLQ(®), 7 complex rational, (2) 
j= 


Q; being a cube with rational corners, Q;c a 


is dense in L,(Q). “r; complex rational” means that Re 7; as well as Im7r; are 
rational. zg,(x) is the characteristic function of the cube Q;. “Rational corners” 


means that the corners of Q; have the form x= (a), ..., xn), x rational. This set is 
countable. Hence L,(Q2) is a Hilbert space. 


Remark. In the same way it is possible to define complex spaces L,(Q) with 1=p=~—. If 
i=p<-, then (2) is a dense set, and one obtains separable complex Banach spaces. Cor- 
respondingly, the real spaces L,(Q) of Subsec. 14.6.4. are real separable Banach spaces. 


17.3.3. Orthogonal Systems 


Definition. Let H be a Hilbert space. 

(a) Two elements uc H and véH are called orthogonal if (u, v) =0; u¢H 7s said to 
be normalized cf |lul|=1. 

(b) A (finite or countably infinite) system {uj}; is called orthonormal tt ie) — 0 
for j+k and |\uj|=1. If eC A, then a,=(x, uz), k= 1, 2, ..., are called the Fourter 
coefficients (of x with respect to {u;}). 
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(¢) An orthonormal system {u;})<1,2,... ts called complete (or closed) if («, u;)=0 for 
j=1, 2,.... of and only Ges 0. 


Remark 1. H=C,, is a special, finite-dimensional Hilbert space. In this case u=(w, ..., Un) 


n 

and v=(r4,..., %) are orthogonal if and only if the (complex) scalar product > u;o;=0.If 

a al 
{uj ja is orthonorial, then in this case every element «¢H can be represented as 

n 
a= Z Oj; . 
J=1 
. . w 

From this it follows that (2, w,)= > a j(%;, Up) =a,, Which means that the «,’s are the 
5 : ome j=l j 
Fourier coefficients. A special complete orthonormal system in C, is uj;=(0, ..., 0, 1, 0, ... 0), 
j=1....,. In other words, the above concepts represent the beginning of an analytical 


geometry in C,. The Hilbert spaces to be considered here will in general be infinite-dimen- 
sional. i.e., different from C,,. Thus the considerations to follow can be regarded as an ana- 
lytical geometry in the infinite-dimensional (complex) Hilbert space. 


Remark 2. If «=0, then (a, u;) =0, i-e., in one direction the condition in part (c) of the above 
definition is always satisfied. This raises the question as to whether there exist complete 
orthonormal systems in an arbitrary Hilbert space. In the case H=C,, this is clearly shown 
by Remark 1. 


Theorem. Let {uj} be an orthonormal system in the Hilbert spuce H. Further let 
ap=(x, uz) be the Fourier coefficients of xc H. 
(a) For every x€H, 


> lal2=||a|!? (Bessel’s inequality) . 
i 


n 
Further, { > zu} is a Cauchy sequence in H. 
i Sik flee 
(b) > ajuj converges to x if and only if 
=n 
> lajl2=|lal|2 (Parseval’s equation) . 
J 


(c) {uj} 2s complete if and only if Parseval’s equation holds for every xe Hl. 
Remark 3. If {w;} = {w;}j=1 is an infinite orthonormal system (which is the normal case), then 


>= > in Bessel’s inequality as well as in Parseval’s equation. Then the convergence in (b) 
j Heel BS . . eye 

must be understood in the sense of letting n >. It is clear low the propositions are to be 
understood in the case of finite systems (and especially in the case of finite-dimensional 


Hilbert spaces). 
Remark 4. In the space Jy, wu?) =(0, ..., 0, 1, 0, ...), j=1, 2, 3, ..., is a complete orthonormal 
: j 


system. If 7 =(21, %9, ...) Ela, then 2; =(x. u) are the Fourier coefficients of « with respect 
to this system. ; 

Remark §. From (b) and (c) one obtains the following proposition. The orthonormal sys- 
tem {u;} is complete if and only if the (finite) linear combinations of the elements w, ta, «.. 
form a dense set in II. 
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17.3.4. Schmidt’s Orthogonalization Process 


Theorem. In every Hilbert space there exist complete orthonormal systems. 


Remark. The proof of this theorem is based on Schmidt’s orthogonalization process. Onc 
starts with a countable, dense sct {uj}ja1, where it is allowed to assume that 2;+0, and sets 


X4 ., 
v4 fel If v’5 =a — (ag, 04), +0, then one sets vy = , and v,=0 otherwise. Then one sets 
4 


v4 
2 > Pl 


04 = 25 — (25, 04)¥4 —(24 Vg)¥q, and v3=—, for v3+0, vg=0 otherwise. Iteration yields a 


lle a 


system v1, v9, ¥3, ... Finally, omitting those v;’s which are zero, one obtains a complete 
orthonormal system. 


17.3.5. Orthogonal Decompositions 


The space C, is spanned by the orthonormal elements (0, ..., 0, 1, OO; , Oe Pur 
ther, C, can be represented as an orthogonal sum C, = Cy, ®Cy,, with n=n, +n. 
This raises the question as to whether this is feasible in arbitrary Hilbert spaces. 
Theorem 17.3.4 provides the necessary basis for investigations of this kind. 


Definition 1. A set M in a Hilbert space ts called convex if x6 M, ye M and 0St=1 
implies that tr +(1—t)ye M (Fig. 17.19). 


ixt(-by 


EaA 
ta a » 
0 ling, 17 18) 
Lemma. Every nonvoid closed convex set in a Hilbert space contains exactly one ele- 
ment with a minimum norm. 


Remark 1. If J7 is convex and closed, then the lemma says that there exists a z¢ M7 such 
that |jzl|= us \z||. This is plausible but not trivial. 
TEL 


Definition 2. (a) A linear subset of a Hilbert space ts called a subspace. 
(b) If H, is a subspace of a Hilbert space H, then H} ={y| y¢H, (x, y)=0 for 
all x€ H} ts called the orthogonal complement (to H,). 


Remark 2, Thus H, is a subspace if Aw +my¢€H, for every 2¢€H, and y€H, and arbitrary 
complex numbers A and ys. In particular, one thus has 0¢ H,. From the linearity of the scalar 
product it follows immediately that Hj is a subspace, too. By analogy with the case of C, 
(or R,), the question that now arises is whether H, and H, span the entire space H. 


Theorem. If 1, 1s a closed subspace of a Hilbert space H, then every element x€ H 
can be uniquely represented as 
T=2i1-e,, where CH, lant Va. oH 
(Fig. 17.20). 
Remark 3. The hyperplane M={y | y=2+2z, z€ H,} is a closed convex set (Fig. 17.21). Let 


xj, be the element with the minimum norm as referred to in the above lemma. Then 2 =a, + 
+a; yields the desired decomposition. A corresponding conclusion in Rz is intuitively clear. 
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Many propositions in the Hilbert space are simply the counterpart of corresponding gco- 
metric facts in Rs. A minor difficulty is involved by the fact that in general Hilbert spaces 
are not finite-dimensional. Thus, for instance, the above theorem is not true if Hy, is not 


closed. (In a finite-dimension al space, every subspace is closed, so that this difficulty does 
not occur.) 


Fig. 17.21 


Remark 4. The theorem can also be written as 
H=H,®H; (orthogonal sum) , 


and # is said to be spanned by H, and H;-. It is easily seen that H, and H> are again Hilbert 
spaces (according to our definition, it is necessary to prove their separability for that pur- 
pose). Now the above procedure can be repeated to represent Hj} as an orthogonal sum of 
two Hilbert spaces Hy and H,: H> =H ,@H3. Combination with the above formula gives 


H=H,® HH. Hs. 


Iteration yields 


n 
H=H,@420...0H,=>0H, . 
k=) 


n 
To this there corresponds a unique decomposition «=> a,, and in particular one obtains 
=! 


(xg, ;) =0 for k+l. Conversely, if Hy, ..., H, are closed, pairwise orthogonal subspaces of a 
nr 
Hilbert space H, then it is clear what is meant by >\@ H;, even if the space spanned is not 
all n 
the whole space #: the set of all elements x€ 4 that can be represented by x= >, x,, where 


v,€H,. li Hy, Hy, ... are closed, pairwise orthogonal subspaces of a Hilbert space H, then 


2 n 
let >) @ /F, be the smallest closed subset of H that includes all spaces >@H,, n=1, 2, 3, ... 
k=1 k=1 


Remark 5. If H, is a closed subspace of a Hilbert space H, then (Hj)+ =H, Besides, Hj is 
always a closed subspace (even if H, is not closed). 


14* 
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18. Orthogonal Series 


18.1. n-Dimensional Trigonometric Functions 
18.1.1. Orthonormal Systems 


If 2 isa domain in f2,, then, according to Subsec. 17.3.2., L4(Q) is a (complex 
separable) Hilbert space. Theorem 17.3.4 then shows that there exist complete 
orthonormal systems {f/,(x)}j_, in L.(2). These functions f,(x), fo(x), ... span the 
space L,(2), in the sense set out in Subsec. 17.3.5.: they play the role of the co- 
ordinate axes in R, or C,. It is plausible that it will be attempted to construct 
explicit complete orthonormal systems in L.(2). For general domains 2, how- - 
ever, this is hardly possible (at least if well-interpretable, simple systems are 
desired). The situation is improved decisively if, say, the cube 


Q=Q= {a | Hema Wares cases) eee tao lajl<z7 forj—1) 48} (1) 


is chosen asa fundamental domain (Fig. 18.1). Let Z, = {m | m=(m, ..., mn) Rp; 
m; integers} be the lattice of the points of &, with integer co-ordinates. As usual, 
n 


ie 2 Or C(t elt, andl 7 = (10 ee ete 
j=! 


Lemma 1. {(27) eter is an orthonormal system in L(Q). 


Remark 1. This can be proved by simple integration. A less simple problem raised by this 
Jemma is whether this system is complete. 


Remark 2. The system stated in the above lemma can be rewritten as follows. One has 
el?™ — (cos m 424 +1 SIN 2,24) (COS Moy +i SIN Mg%o) ... (COS MyXn +i SIN MX) 
and conversely, cos... and sin... can be represented as linear combinations of e!. As cos 


and sin are even and odd functions, respectively, it will suffice to restrict our considerations 
to Nyz={m | m=(m, .... Mn) ESn, mz =O}. 


vi 
jews (cos cos : 
Lemma 2. In the system {0;,'7 | mars} a { : mcr} wel On 2) 
: sin sin MEN p 


where k 1s the number of zero co-ordinates, m;=0, in m= (mM, ..., M,)EN,. Here 
sin 


being admissible). Lf the identically vanishing terms in the above system are omit- 
fed, one obtains an orthonormal system in L»(Q). 


cos : : eee 
my| means that erther cos m,xr, or sin myx, can be taken (all combinations 
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Remark 8. The somewhat clumsy formulation is explicable by the different values of 


Tv ™ 7 Tv 
J costidi= f sin%tdt=r, f cos 0tdt= i Qa » 
=e = 


7 Tr. — 0 
l=1,2,... Anyway, it is easily shown that Lemma 2 is a reformulation of Lemma 1. 


Remark 4. In the case n =1, i.e., Q=(—7, 7), the systems of Lemma 1 and Lemma 2 are 


Ue ee ot ins ° 
—— oiled and —- = COS MX, —— SIN MH 
|) Bae M=— co | ee ae ss m=1 


These are periodic functions in the interval (—7. x). 


Remark 5. If the domain considered is not the special n-cube Q of (1) but a general »-dimen- 
sional interval (n-cuboid) g={~|«x€R,, |e;—a,;|<b; for j=1,..., n}, then the systems of 
Lemma 1 and Lemma 2 can easily be converted to this case (Fig. 18.2). Then Lemma 1 
reads; 


1 Rn incepta) 

———— : 2 

i= -25 yh ie ( ) 
SIDS) AUG) cron, Sip meLZy 


is an orthonormal system in L(g). Lemma 2 has an analogous formulation. 


Fig. 18.2. 


18.1.2. Fourier Coefficients and Absolute Convergence 


Consider the Hilbert space H = /.(Q) of Subsee. 18.1.1. and the orthonormal 
system of Lemma 18.1.1/1, then 


n n 


m= (fy (28) Fe) = (2m)? fla) eae (1) 


are the corresponding Fourier coefficients in the sense of Def. 17.3.3 (b). Here 
meZ, and f€ Lo(Q). We recall that, by Def. 14.6.4/2, f(x) is called a function with 
bounded support (or finitely nonzero) if supp {/cQ. 
Theorem. Let Q be the n-cube of (18.1.1/1), and let a, be the Fourier coefficients of 
(1). 
(a) Tf fe Lo(@), then 
D> laml2SlVfllicq  (Bessel’s inequality) . (2) 
meZLn 


(b) If f(x) is k times continuously differentiable and a function with bounded sup- 
port on Q, then 


l¢ml lml~‘lbnl where |ml+0 and Dd) lb,l2<. (3) 
mMEZLy 


n 
: 2 
Here k=1, 2, 3, ... and |m|2= >) mz. 
k=1 


(ec) If flr) is k times continuously differentiable and a function with bounded sup- 
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; n 
port on Q, and if k>-> , then 


g(x) = 2 a ce (4) 
Qn] 
converges absolutely and uniformly on Q. In particular, g(x) is continuous on Q. 


Remark. Formula (2) can be directly inferred from Theorem 17.3. 3(a) and Lemma 18.1.1/1. 
The same theorem also implies that the series (4) converges at least in L,(Q). The absolute 
convergence of (4) results from (3): 


1 
= 1 
> l4ml Sl 2 (lem |mel*)2] 2 ( A =) are (5) 
meZy m+0 m+=0 | 

m+0 


1 : : 
because > —— can be essentially estimated by 
| 2k 
m+0 [a 


: Saat 
= (m3 + ee an = 


k 
my = m,=1 Ca mM: 


Thus part (c) can also be reworded: with the assumptions of part (c), the series >) |@m! 
converges. meZy 


18.1.3. Periodic Trigonometrie Series in Lo(Q) 

Theorem. ‘If Q is the n-cube of (18.1.1/1), then the orthonormal systems of Lemma 
18.1.1/1 and Lemma 18.1.1/2 are complete. 

Remark 1. A relatively concise proof of this important theorem has been given in [36]. 


Remark 2. Thus we have found the connection to the gencral theory of Subsee. 17.3. 
Theorem 17.3.3 implies that, for all f€L(Q), 


Cm ee 
= 2 vem in Ly(Q) (1) 
meZy (Qn)2 
and >) lanl? =z, (Parseval’s equation). Here a, are the Fourier coeffients of (18.1.2/1). 


MEZy 
This is the expansion of functions f(x) € £.(Q) in Fourier serics. Accordingly, an expansion 
can be written in sin/cos terms. If the assumptions of Theorem 18.1.2(c¢) are satisfied, then 
g(~) =f(a) in (18.1.2/4), and (1) converges not only in L2(Q) but also absolutely and uniformly. 


18.1.4. Semiperiodic Trigonometrie Series in L,(Q) 
_ The numbers en have the same meaning as in Lemma 18.1.1/2. Further let 
N,=im|m=(my, «., %_)€Z_, m;= 1}, and let 


g={x|e=(xy, «, tn)ER,, O<aj<n for j=1,...,n} (Fig. 18.3). 


n 


ro 3 


7 


We 2 De il 
Theorem. (=) [] sin mea} and {(=) — [|] cos m,2, are complete 
Tt meENy 


p= ™ Omk=1 meENy 
orthonormal systems in Lo(q). 
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Remark 1. For x =1, ¢=(0, =), and the systems have the form 


iD oo 1 DQ co 
| = sin ma aii a | = cos nie (Fig. 18.4). 


m= 


a 


Vx ‘ m=! 


Fig. 18.3 0 = Fig. 18.4 


Hence these systems comprise the semiperiodic functions on the interval (0, x). The number 
of sin and cos functions admitted to competition has now increased (as compared with the 
periodic functions of Subsecs. 18.1.2. and 18.1.3.; note that (0, =) is the fundamental 
mterval now), but it will snffice to consider either the sin or the cos functions alone. 


Remark 2. The theorem is proved by reducing it to Theorem 18.1.3. If f€ L(g), then f is 
continued to the n-cube Q of (18.1.1/1) as an even or odd function, and the function con- 
tinued in this way is expanded in the sense of (18.1.3/1). If this expansion is reduced to 
x€q, one obtains the completeness of the systems referred to in the above theorem. The 
fact that the systems are orthonormal can be verified directly. 


Remark 38. By analogy with Remark 18.1.3/2, it is now possible to expand functions f€ L9(q) 
in terms of the systems of the above theorem. If g is the general n-dimensional interval 
(n-cuboid) of Subsec. 18.1.1., then, by analogy with Remark 18.1.1/5, a corresponding 
modification must be applied. 


18.1.5. Example 


We develop f(r) =r? on the interval (—7z, z) in terms of the (complete ortho- 
normal) second system of Remark 18.1.1/4. The Fourier coefficients are easy to 
calculate explicitly, and one obtains 


—? co ( m 
ae —- 
9 é 


w= he : 
The theory ensures the convergence of the series in Lo(@Q), with Q@=(—7z, z). It is, 
however, immediately seen that the series also converges absolutely and uni- 
formly. Then the series must also converge pointwise to .?. Setting r=7x and 
x=0, one obtains 


cos mx, xr€(—7, 7). (1) 


Se as DN ( Salhiees ee 
> —a=—p and > oot —. 
we me el m2 12 


The first result is already known to us from Theorem 15.9.3/2. It is possible to 
obtain other results of this kind: by twice integrating (1) one obtains a 
= 
representation of x‘, from which a formula for >’ ma Ce be derived. it is 
mat mn 
worth while playing a little with such expansions. 
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18.2. Orthogonal Polynomials 
18.2.1. Approximation Theorems 
lf 6 is the small positive number from the Figs. 18.5 and 18.6, then x; = 


=cos yz, k=1,...,n isa mapping of Q onto q, with derivatives up to an arbitrary 
order. A function f(2x,, ..., 2,) which is & times continuously differentiable is 


6 K-O XK 


Fig. 18.6 


transformed into a function g(yy, ..., Yn) =H(x4(y4), --) Ln(Yn)) With the same pro- 
perty, and vice versa. 


Theorem 1. J f(x) is a function with bounded support on q which is k times contin- 
uously differentiable there, where hoes then for every positive number e there 


exists a polynomial P,(x)= P.(x4, ..., X_) such that sup |f(x)— P,(x)| se. 
req 


Remark 1. The theorem is reduced to Theorem 18.1.2(c): by the mapping x, =cos y;, f(a) 
is transformed into a function on Q, and the new function g(y) is approximated in the same 
way as in Theorem 18.1.2(c). Here, however, one chooses the cos system of Theorem 18.1.4, 
which gives : 
n 
WY)— 2 Om Lf cos meyy | Se 
mlsMo k=1 

with a suitable choice of M and a,,. In view of Moivre’s formulas of Subsec. 5.2.4. one has 
cos my =P(cos y) (polynomial in cosy). By insertion of this polynomial and inverse 
transformation one obtains the above theorem. Thus the theorem is derived from Theorem 
18.1.2(c) by a few simple transformations and modifications. 


Remark 2. Application of simple linear transformations shows that the theorem remains 
true if g is an arbitrary n-dimensional rectangular parallelepiped in 2. Further, by applying 
Sobolev’s mollification method referred to in Subsec. 14.6.4., one observes that the theorem 
is also true for arbitrary #-dimensional rectangular parallelepipeds g and for arbitrary 
functions that are continuous on g. This is Weierstrass’ approximation theorem. 


Theorem 2. /7 27s a boundet domain in Rh, and if lS p~<~@, then the polynomials 
in L,(2) constitute a dense set. 
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Remark 3. The theorem follows from Theorem 14.6.4/2 and Theorem 1. 


Remark 4. Now a complete orthonormal system in Lo(Q) can be constructed using Schmidt’s 
orthogonalization as described in Subsec. 17.3.4. 


18.2.2. Legendre’s Polynomials 


Let us consider the one-dimensional case, with Q=(—1, 1). 


Theorem. (a) Wath a suatable choice of the real constants c,, the Legendre polynomials 
de : 
Ly (x)= cy qa td —S) eh &k=0,1,2,... and =Wersl (1) 


constitute a complete orthonormal system in L4(Q), with Q=(—1, 1). 

(b) If {P.(x)}F-0 ts an orthogonal system of polynomials in L,(Q), with Q= 
=(~—1,1), and if Py(x) has the degree k, then P,(x)=d, L(x), where the L(x) 
are the Legendre polynomials of (1) and d, are suitable numbers. 


Remark 1. The truth of (L;, Li)z,@)=0 for k+l is verified by partial integration. The 
completeness of the system is inferred from Theorem 18.2.1/2. 


Remark 2. Thus every function /€L,(2), with 2=(—1, 1), can be expanded in terms of 
Legendre polynomials: 


{ 
L(x) j Hy) Ley) dy . (2) 


19. Partial Differential Equations 


19.1. ‘Types of Partial Differential Equations and Physical Examples 
tet. Types 


In Chaps. 4 and 10 we considered ordinary differential equations of first and 
higher orders, 
(ae ein ee reece i=) 


Qualitative propositions (Chap. 4) and explicit solutions (Chap. 10) were a pri- 
mary consideration (Fig. 19.1). Now we are looking for functions u(x) = u(r), 5 
x,) (Fig. 19.2) which are solutions of partial differential equations of first and 
higher orders, 

a wi =| =0, F (« a 2 =), 


2 —— 5 he == 5 000 SSE 
oe «Gx, Ox; OL,OX, 
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yix) 


Fig. 19.1 Fig. 19.2 


In this case there is no counterpart to Chap. 10: apart from very special partial 
differential equations it is not possible to state the set of all solutions of a given 
partial differential equation (of higher order) explicitly. There is, however, an 
established qualitative theory. Here it turns out that some essential parts of the 
theory of first-order partial differential equations can be developed largely on the 
model of the theory of ordinary differential equations. Virtually new effects will 
however occur in partial differential equations of higher order. Partial differen- 
tial equations of second order are of special interest here, because many problems 
of physics can be reduced to such kind of differential equations. In this chapter 
we shall discuss the three physically interesting basic types of linear partial 


9 


nr 
differential equations of second order: if x=(a),...,%,)€R,, then A= > A 
jaa 
called Laplace’s differential operator (Laplacian). (x, ¢)€R,,; means that 
= (24, ..., T)ER, and é€ Ry. Thus in particular one has u(z, t)= | cece 
The thrce basic types are 


is 


Mu fa) Laplace- Poisson equation , (1) 

Au— ani. t) wave equation , (2) 
Ou : : 

Au aor =f(x,t) heat conduction equation . (3) 


In (1), u=u(x) depends on x€ FR, alone, whereas in (2) and (3) it depends on 


(x, 4)€ R,,;. In any case, however, the Laplacian applied to u(x, £) is defined as 
Rr C24 


Mo) = > 


ae 
rately in (2) and (3). It turns out that a great number of physically interesting 
linear partial differential equations of second order can (at least qualitatively) be 
reduced to these three basic types. Their treatment thus gives a representative 
insight into the theory of linear partial differential equations of second order. 


that is, the derivatives with respect to ¢ are written sepa- 


19.1.2, Physieal Examples 


The vibrating string (Fig. 19.3): Consider a string that extends over the inter- 
val [0, /] of the x-axis, being fixed at its end points x=0 and r=l. If the string 
is deflected and then allowed to move, it will vibrate. When the string is let 
loose, it can be given a push to realize an initial velocity in this way. If u(c, t) 


3 


with 0=x=/ and ¢=0, is the deflection of the string from the normal position at 


ut 
{ 
1 


mee 
0 l Fig. 19.3 


Ores 19.2. The Laplace- Poisson Equation 8h 
lah Cle 


the time ¢ and the position x, then for small deflections the vibration process is 
described by the wave equation (19.1.1/2), with f(x, 1)=0 (for normalized matter 
constants). Altogether this leads to the following problem: desired a solution of 
the differential equation 


oe 
of _ =0 for 0=x=/ and t=0, where 
Ou sone ne 
u(x, 0)=ug(x) and eva (x, 0)=u,(x) (initial conditions) , 


u(O, 2)=u(l, t)=0 (boundary conditions) . 
Here uo(x) (initial deflection) and w,(z) (initial velocity) are given functions. 


Heat conduction: Consider a bounded body 2 in Ry, with a sniooth boundary 
e2. Suppose that at the time t=0 the body exhibits the temperature distribu- 
tion w(x), x€ 2. Further suppose that on the surface 62 of the body a time-inde- 
pendent temperature distribution ¢(y), y€éQ, is given. Find the temperature 
distribution u(x, é) at the time ¢ and the position x€ 2 in the interior of the body. 
For normalized matter constants this leads to the solution of the problem 

u(x, ¢) 


a 


Au(x, t)- =O tor 262 sand 7=07 where 


He Vwi tor 962 (nitial condition), 
u(y,t)=o(y) for yeeQ and ¢t=0 (boundary condition) . 


Stationary distributions: In the problem of heat conduction just investigated, 
consider the case that to: what will be the temperature reached in the interior 
of the body 2 after a long time? It will be presumed that the initial temperature 
distribution will have no influence on this final temperature, whereas the (time- 
independent) temperature distribution on the surface will influence it. The final 
temperature distribution u(x) does not depend on ¢. If (rather daringly) u(x, t)= 
= u(x) is inserted into the above equation, then one has a plausible argument for 
the following statement: the temperature distribution u(z) in the interior of the 
body 2 after a long time, with the time-independent temperature distribution 
¢(y) being given on the surface 02, is the solution of the problem 


Au(x)=0 for 760), 
u(y) =o(y) for yEeeQ (boundary condition) . 


Remark. Thus we have three typical physical problems which lead to three typical problems 
for the three basic types. Physics gives indications as to what kind of mathematical prob- 
lems might be fruitful. 


19.2. The Laplace-Poisson Equation 
19.2.1. Fundamental Solutions and Integral Representations 


Singularity solutions: In £,\{0}, we are looking for twice continuously differen- 
tiable functions u(x) =u(z4, ..., 7,) with Au(r)=0 in F\{0}, which depend on 


r=YVri+...+22 alone, that is, u(y, ..., ¢,)=v(r). If one introduces Au(x) =0 into 
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: 2 : : . . dv n—-1dy 
this setup, one obtains the ordinary differential equation Slee ene dra 
ey 
Besides the solution v(r)=c (which is of no interest here) one obtains 
1 
vo7)=—— for n=>2 and ov(r)=Inr for n=2 (1) 
pd 


(apart from a constant factor). Here we always assume that n= 2. 


Notation: In this subsection we consider connected standard domains 2 in RF, 
in the sense of Subsec. 9.3.1. (Fig. 19.4). In particular 2 is then bounded, the 
integral theorems of Sec. 9.3. are applicable and it makes sense to speak of the 
exterior normal » with respect to the boundary @2 of Q (except for some corners 
and edges). If on an arbitrary domain 2 (which need not necessarily be a stand- 
ard domain) the real-valued function f(x) has all its partial derivatives up to 
and including the order f, and if all of these derivatives are continuous on Q, then 
we write /€C*(Q). If, in addition, all these partial derivatives can be continued by 
continuity to Q, then we write f€O0*(Q). Here k=0, 1, 2, ..., and f(x) is identified 
with the zeroth derivative. Instead of C°(2) or C(Q) we also write O(2) or 
C(Q), respectively. 


GQ 
, 
> 
v Fig. 19.4 
Definition. Let 2 be a connected standard domain in R,, and let x®°€ 2. Then 
| e 2 
_ S(n— 2) log boa (2) jor a= 2 


Y20(x) = 1 
——In|xr—2°]4+O(x) for n=2 
27 


ts called a fundamental solution if ®€C2(Q) and A®(x)=0 on Q. 


Remark 1. |@,| is the volume of the unit sphere in #, as defined in Subsec. 9.2.6. 


Remark 2. (1) are solutions of the Laplace equation Au(x) =0. Then cu(a« —2®) are solutions, 
too. Thus one has Ay, o(x)=0 on \{x°}. Hence the fundamental solution is a normalized 
singularity solution, where @(a) is still artibrary. 


Theorem. Let 2 be « connected standard domain in R,. Further let u€ CQ) and 
Au(x) = f(x) for r€Q. Tf x®€Q, then 


ran Oy x0 . 
Ok @al) Vee f [re Sea 2 ds — i Ge) hen ese | (3) 


op 
a2 


Remark 8. Here y,0() is an arbitrary fundamental solution as per the above definition. 
The first integral is a surface integral in the sense of Subsec. 9.2.5. 


Remark 4. (3) is proved by applying the second Green formula as stated in Subsec. 9.3.2., 
where the notation 2 used there is replaced by 2\K,, and g and f are replaced by u and 0, 
respectively (Fig. 19.5). Here K, is a small ball of radius ¢ about 2°. Some estimates 
lead to the desired result for ¢}0. So the formula is less intricate than it looks like. 
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Fig. 19.5 


19.2.2. Green Funetions 
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Definition 1. Let Q be a connected standard domain in R,,. A real-valued function 


u(x) ts called harmonte on 2 if u€C2(Q2) and Au(x)=0 for x€EQ. 


Remark 1. There exist functions u(z) which are not continuous at all points of Q, but 
nevertheless satisfy the equation Au(x)=0 for all x€Q. According to the definition, such 


functions are not harmonic. 


Definition 2. [f Q is a connected standard domain in R,, then g(x®, x) is called 
Green function (of the first kind) af, for the fixed x® €Q, the function g(x, x) is a fun- 


damental solution in the sense of Def. 19.2.1 and g(x, y) =90 for all yeoQ. 


Remark 2. Every z°¢€Q is admitted to competition, and 9(#%, x) is regarded as a function 
of x on 2, A comparison with Def. 19.2.1 shows the problem that arises here: are there 


any harmonic functions ® such that y,yo(7) of (19.2.1/2) becomes a Green function? 


Remark 3. (19.2.1/3) takes a simpler form if y20(z) =g(x®, x) is a Green function. With the 


assumptions of Theorem 19.2.1 one has 


if) 
wat) — fu OP doy folae, wile) de. 
2 


ov 
62 
Tf u(x) €C2(Q) is harmonic on Q, then one obtains 


09(x°, 
u(x) =— | uly) ne dsy . 
d2 


This is an interesting formula we shall make use of later on. 


Theorem. Let Q=Kp={x| lal< RK}. 
(a) If n>2 and 2°€ Kp, then 


; 1 1 ree 1 oe 
LEIGH Vogl lear? VA [Re pay Or Fh 
x—-—, x? 

{ar0|2 


il 1 1 
10.2) — yal EP 
7s a Green function on K zg. 
(b) Lf u(x) €C2(K z) is harmonic on Kg, if n 22, and Uf CK p, then 
2 |al2 uly) 


SS ae . 
MeO Raia yf OP 
R 


2 


(4) 


R ose Neh 
Remark 4. The point y9=x° —- is the mirror image of #®¢ Ay with respect to the sphere 


|9|2 


(ef. Subsec. 15.8.6; Fig. 19.6). In particular, y® is exterior to Kp. Then it is clear that 
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g(x°, «) is a fundamental solution in the sense of Def. 19.2.1. It remains to show that 
g(x, x) =0 for |a| = R. One obtains (4) from (2) by insertion of g(x®, x). (4) is also valid for 
n= 2, whereas g(x, 2) has to be modified for n=2. 


---@ 
Vv? 
Fig. 19.6 
Remark 5. By insertion of u(x) =1 into (4) one obtains, for |2®|< R and n=2, 3, ..., 
R2— |220|2 dsy 
— Rlw,| ly — 29)” ° 
OK R 


(5) 


Marginal note of philosophical nature: if you should have forgotten the magnitude of 1, 
then you can evaluate the integral in (5) numerically, which will give the desired value (at 
least approximately). 


Remark 6. For general standard domains it is not possible to state the Green function explic- 
itly. It is however possible to make the following important statement (which is equally 
important for physics and mathematics): if 2 is a connected standard domain in R, and 
x°€Q, then there exists a unique Green function on 2 with 2° as singular point. 


19.2.3. Properties of Harmonie Functions 


If 2 is a bounded domain in R,, then u(x)€C(Q) is said to have the mean 
value property on 2 if, for every point x°¢€ 2 and every ball K = {x | |x—.x®|<e} 
with AcQ, i 


1 
elles aR] few) ds. 
aK 


Here 0K is the boundary of A and |éK| is its volume (see also Subsec. 9.2.6.). 


Theorem. Let 2 be a bounded domain in Ry, and let u(x) €C(Q) be harmonic on Q. 
Then 

(a) u(x) has the mean value property on Q. 

(b) u(x) as differentiable on 2 up to arbitrary order. 

(¢) (Maximum-minimum principle): u(x) assumes its maximum and its mini- 
mum on @Q. 


¢ 


Remark 1. All the propositions of the theorem are rather simple implications of the repre- 
sentation formula (19.2.2/4). 


Remark 2. For n=2 there is a relationship with holomorphic functions; cf. Subsecs. 15.1.4. 
and 15.3.4. 


Remark 3. Part (a) of the theorem characterizes harmonic functions: u(x) €C(Q) has the 
mean value property if and only if w(a) is harmonic on Q. 


Remark 4. By analogy with Subsees. 15.3.4. and 15.3.5., the following statements can be 

made: (a) A function that is harmonic and bounded on R,, is constant. (b) If Q is a connected 

bounded domain in R,, if u(a)€C(Q) is harmonic on Q, and if u(x°)=max u(y) for some 
: yea 

point #9 €Q, then u(x) is constant on Q. An analogous proposition holds for the minimum. 
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19.2.4. Dirichlet’s Boundary Value Problem 


Definition. (Dirichlet’s boundary value problem). Let Q be a connected standard do- 
matin tn R,, and let p(y) be a continuous function on the boundary 62. Desired a 
function u(x)€O(Q) such that u(y) =¢(y) for yE0Q, which is harmonic on Q. 


Theorem 1. Dirichlet’s boundary value problem haus at most one solution. 


Remark 1. The theorem is a simple implication of the maximum-minimum principle stated 
in Theorem 19.2.3(c). 


Theorem 2. /f Q=Ap={r | |xl< Rj, and if vy) is continuous on OK p, then 


aie) 
CCG = all eee (1) 
OK p 


as the unique solution of Dirichlet’s bou ndary value problem for the ball K p. 


Remark 2. If there is a solution at all of Dirichlet’s boundary value problem for Q= K p, 
then it must necessarily be of the form (1). This follows from (19.2.2/4). One must, however, 
check explicitly whether (1) is indeed a solution. 


Remark 3. It can be shown that Dirichlet’s boundary value problem as defined above 
(i.e., for arbitrary connected standard domains) has exactly one solution. The existence 
theorem is, however, rather complicated; see for instance [48]. 


Remark 4 (stability). Let p,(y) and @(y) be continuous functions on 9Q, and let u,(x) and 
w(x) be the associated solutions of Dirichlet’s boundary value problem in the sense of 
the above definition. Then. 


2) ne i) 934) (2) 
This expresses stability: small changes of the boundary values will also cause only small 
changes in the solutions. (2) is an implication of Theorem 19.2.3(¢). 


Remark 5 (physical interpretation). The physical interpretation of Dirichlet’s boundary 
value problem has been given in Subsec. 19.1.2. (stationary distributions) for the case 
BSS. 


19.2.5. The Poisson Equation 


Usually Au(x)=0 is called the Laplace equation and Au(x) = f(x) is called the 
Poisson equation (or inhomogeneous Laplace equation). 
Theorem 1. Let f(x)¢€C?(R,) be « function with compact support in Ry. Lf 
1 


las (n=2) Toral 


(= dy (Newton’s potential) , (1) 
ce 

n 

with «ek, and n=3, then u(x) €C(R,), and Au(x) = f(x) for re R,. 


Remark J. If we are unscrupulous enough to differentiate 


il {(x —z) 
=o) rail ep 
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under the sign of integration, then by Theorem 19.2.1 we obtain 


i ae [ Coe demi 


(n —2) |@n| [eis 
n 


This is the desired proposition, where however the differentiability properties of u(x) must 
be carefully proved. 


Remark 2. The theorem is also true for {€C'(R,) (provided that f has a compact support in 
R,), but the proof is more complicated. 


Theorem 2. Let Q=Kp={xr| xER,, |e] =< R}, and let n=3. If p(y) ts continuous 
on OB py, and tf f(x) €C2(R,,) with a compact support in R,,, then Dirrchlet’s boundary 
value problem for the Poisson equation 


Awa\=f(e) jor w@ekp,  ulsjecoe 
uly) =9(y) for yeeKp, u(x)eC(Q), 
has a unique solution. 
Remark 8. It is rather easy to reduce this theorem to Theorem 1 and Theorem 19.2.4/2. 


Remark 4. The theorcm remains true if 2 is a connected standard domain, p(y) is continuous 
on 9Q, and {/€O'Q). But the proof is then more complicated; cf. Remark 2 and Remark 
19.2.4/3. 


19.3. The Wave Equation 
19.3.1. Uniqueness Theorems 
Hi = (Gay oy t,) © and fe, then wersetiG ic 72,0,. ict 
=a ae) eet Olen lea yes 


Then Ri.= (a, a, eR, (=O). 


Definition 1 (¢nitial value problem or Cauchy's problem). Let uo(x)eC(h,), 
u(x) EC(R,), and let f(x, t)€C( RZ, |). Desired a function u(x, t)€ C2( Rx.) such that 


07u 
Aule, )- Se (es N= He, 1 for (a, NER, 
u(x, )EC(Rt1) and u(x, 0) =u9(z) jor 2G, , 
Ou 


ou 
By eC i mand op (v2 8) = tale) ee vee 


Remark 1. The spaces C#(R,,) cte. have the same meaning as in Subsec. 19.2.1. According 


nr ly 
to the convention made in Subsec. 19.1.1., Au(a, t) = > — > (2, t), that is, A always applies 
to the space co-ordinates a, ..., %, alone. kai Oty 


Theorem 1. The initial value problem as per Definition 1 has at most one solution 
(in the class of sufficiently smooth functions u(x, t)). 


Remark 2. The proof is based on the derivation of an integral identity for sufficiently smooth 
functions u(x,t). One considers a circular cone A whose base Dp is a ball in the plane 
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eee eee eee tet 


; T 
é=0 and whose lateral surface includes an angle of z with the é-axis (Fig. 19.7). Let D, be 
the surface of intersection with the plane ¢=const, as indicated in the figure, and let 8, be 
the lateral surface between Dy and D;. Further let » be the normal vector of S; (:hen one 


has cos (y, #)= . Finally, let A, be the truncated cone between Dy and D,. Then, for 


il 
12 


sufficiently high-order differentiability properties, the following identity holds: 


F aa, Ju du 
> Ga-8 ‘\i ae JMG ck We 2 (Ca) |e o 
t 


2 ou Ou Pl rn / du Ou 2 
ei (3) -- S (sz) | da + 4 BOSE) i (= cos (y, é) — 5, 008 (, ~)) Jas : 
t St 


02u 
To prove the theorem, one can assume that ap — —Au=0 and u(z, =" = (2, 0)=0. Then 
the left-hand side of (1) is zero and the right-hand side is =0. This then nee that u(x, f)= 
=0. The cone considered above plays a fundamental role in the theory of the wave equation. 


Fig. 19.7 


Definition 2 (boundary and initial value problem). Let Q be a connected standard 
domain in R,, and let Z=Qx (0, <) be the cylinder over Q, as shown in Fig. 19.8. 
Let uo(r)€C(Q), uy(x)€C(Q), and let f(x, t)€C(Z). Further let y(y) be a function 
continuous on 02. Desired a function u(x, t)€C2(Z) such that 


> J=f(a, t) for G; Hiea. 
u(z,t)€O(Z) and u(x, 0)=u9(x) for xEQ, 


Au(x, t)— 


a a 


a 4) 
- (x, )€0(Z) and m u(x, O)=uj(x) for «EQ as well as 


uly,t)=¢@(y) for yea2 and t=O. 


Fig. 19.8 
Theorem 2. The boundary and initial value problem as per Definition 2 has at most 
one solution (in the class of sufficiently smooth functions). 

Remark 3. The kind of the boundary and initial value realization leads to difficult problems 


we shall not discuss here. This also accounts for the vague formulations of Theorems 1 and 2. 
For a more profound discussion on the basis of gencralized solutions in Sobolev spaces, the 


reader is referred to [66]. 
15 Triebel, Math. Physics 


198 19. Partial Differential Equations Gore. 
19.3.2. The Wave Equation in One Dimension 


Now let n=1. In other words, we consider the (homogeneous) wave equation 
Cu ou 
a ape where u=u(2, ¢é) and x€ A, t€ Ry. 

Ou Ou 
éx2 At? 
u(x, t)=v(a+é)+w(a—t), where vEC(R,) and weC2(h,). al) 


Here xER, and te Ry. 


Theorem 1. w(x, t)€C2( Ry) ts a solution of if and only tf 


Remark 1. It is immediately verified that (1) is a solution. To show the converse is not 
difficult either. Tf v(x) €C7(R,), then, as t+, v(z+t) is a wave travelling to the left, and 
v(a—t) is a wave travelling to the right (Fig. 19.9). Thus (1) is a superposition of two waves 
travelling to the right and left, respectively. 


v (x+t) v(x) v(x-t) * Fig. 19.9 


Theorem 2 (vibrating string of infinite length). If up(x)€C2( Ry) and u(x) ECR), 
then 


1 1 
ule, == (gle +t) + 2(@—1)) +5 . HAG Ce, EP, (20. (2) 
in 4 Zt 
is the unique solution of the initial value problem 
Ou Ou 
x2 at?” 
in the sense of Def. 19.3.1/1. 


uz, O)=uglx) and = Oe) . (3) 


Remark 2. u(a, ¢) has the structure stated in Theorem 1. Further it is easily seen that the 
initial values are correctly assumed. 


Remark 3 (physical interpretation). In the sense of Subsec. 19.1.2. it is possible to interpret 
(3) as the vibration of an infinitely long string. Then wo(#) is the initial deflection and 
u,(x) is the initial velocity. The travelling waves referred to in Remark 1 now have a real 
physical meaning. 


Remark + (domain of dependence). In connection with the physical interpretation given 
in Remark 1, the following statement is of interest, too. From (2) it is seen that for calcu- 
lating the value of w(x, t) at the point (x,t), with w€ #, and t>0, it suffices to know the 
values of uo(y) and w,(y) in the interval [#—#, x+¢] (domain of dependence; Fig. 19.10). 
Tf up(y) and u,(y) are zero outside the interval /, then the effect of the perturbation intro- 
duced by the values of uo(y) and xw,4(y) in J becomes perceptible at the point x only after a 
finite time, that is, when [a —#, +t) NL+9. 


Fig. 19.10 
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a ig 


Theorem 3 (fixed-ended string). Let Q=(0, 1), where 10. If ug(a) €C2(Q), u(x) € 


ECL), and if ug(0) = uo(Z) = 4 4(0) = uj (1) = ug" (0) = ui! (2) =O, then the boundary and 
rnitial value problem 


O2u 02 u 
ee for x€Q and t=0, 
Ou 
u(x, 0) = u(x), ay O)=u,(r) for xéeO, 


On =a iO) for 7 =0 , 
has a unique solution u(x, t)€C(Z), where Z = (0, l) x (0, ~). 
Remark 5. The solution u(x, t) can be constructed as follow 8: u(x) and u,(x~) are extended 
to the interval (—/. 0) as odd functions. Then the functions obtained in this way are ex- 
tended to become periodic functions, with the period 21, on the whole R,. If the result is 


again denoted by uo(x) and u(x), then uy€C%R,) and w,€C'( Ry). Now formula (2). with 
O<a =/ and t=0, represents the desired solution. 


Remark 6. The physical interpretation has been described in Subsec. 19.1.2. 


19.3.3. Initial Value Problems for the Wave Equation in Two and 
Three Dimensions 


Spherical mean values in Rg. If u(x)€C( Rs), then for ‘+0 we set 
1 it : 
(M(t) w) ae f wet) res | u(y) ds, . (1) 
o i =a Sse 
Here is the unit sphere in #3. Further let dy be surface element on w, and let 
v€w (Fig. 19.11). The second integral is taken over the sphere of radius ¢ centered 
at x, where ds, is the surface element. (Surface integrals have been discussed in 


Subsec. 9.2.5.) It is easy to see that the two integrals in (1) are equal. Since 
47/2 is the surface area of a sphere of radius ¢ in Rs, (1) represents a mean value. 


Fig. 19.11 
Theorem 1. Jf wo(x) €C3( Rs) and u,(r)€C2(R), then 


a 
u(x, t) = M(t) w) (@) +5, 


7s the unique solution of the initial value problem 


[¢(M(é) uo) (x)], where xR, and t>O0 = (2) 


at 


bua for xE€h, and t>0, 


Ua Oana), 


Ou 


a O)=u,(r) for «ERs, with u(x, t)eC2( RT). 


15* 
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Remark. It is rather troublesome to prove this theorem, and especially to show that u(z, ¢) 


is a solution of the wave equation. u€C%R7) is to be understood as saying that u as well 


as its first- and second-order partial derivatives are continuously extendable from RF to RF. 


The two-dimensional wave equation. The two-dimensional wave equation is 
treated using the method of reduction of dimensions. If wo(ary, x2) and w(x, x2) 
are the given initial data, then one adds a third dimension: uo(x) = uo(%), £2) and 
u,(%) =U (a1, 2), where r=(x1, 7, ty). It is not difficult to see that then the 
three-dimensional solution u(x, f) is also independent of x. The problem then is 
to convert (1) and (2) into two-dimensional expressions. The two-dimensional 
counterpart to (1) is 


— 1 Cie : 
(Hi) (ey) =5- f —— ee hy, dy, (3) 
US eS Cee Uh —Iy =Up 


where the integral is taken over the disk A = {(y,, yo) | (v7 — yy)? + (42-9)? <P} 
with centre (2), r2) and radius ¢. 


Theorem 2. If u(x, 2) €C3( Ro) and uj(ay, ry) €C2(Ry), then 


= joe ; 
Uy 2) = (Marty) (1 2) +5 (Marte) (tp 29)] with (wy m2) € Ro 


and (#0 


is the unique solution of the initial value problem 
Ou du au 
—st+—5=—5 for (a %)€R, and t=0 : 


ou 
Wei) tig (yo), oe (Gita, O) = to) fC, ae 


where w(x, x, t)ECX RS) . 


19.3.4. Physical Interpretations, Huyghenian Property, Spherical Waves 


Vibrating membrane: By analogy with the infinitely long vibrating string dis- 
cussed in Subsec. 19.3.2., one may consider a membrane of infinite extension 
which, in its rest position, extends over the whole A, plane. If the membrane is 
locally deflected at the time ¢=0, then wo(x1, x) and w(x, x2) are assumed to be 
the initial deflection and the initial velocity, respectively. The (small) vibrations 
of this membrane are then described (for normalized matter constants) by the 
initial value problem for the two-dimensional wave equation as stated in Theorem 
19.3.3/2, where w(24, 9, t) is the deflection at the position (xj, x )€ Ry at the time 
t=0. By analogy with Remark 19.3.2/4 it is possible to determine the domain of 
dependence: the value of u(2, x2, ) depends only on the values of w(x, xy) and 
u (ry, %) IN {(Yp Ye) | (ey — yy)? + (% — Yo)? SL}. Tf up and u, vanish outside the 
perturbation indicated in Fig. 19.12, then this perturbation comes into effect not 
until the cones indicated in the figure touch the domain of perturbation for the 
first time. 


19.3.4. 19.3. The Wave Equation 2 
a a ed 


Za.~ 
perturbation 


Fig. 19.12 


Propagation of sound: Sound waves propagate im the f, as pressure waves. 
If u(x, ¢) 1s the deviation from the (constant) normal pressure at the position 
x€R, and at the time 120, then u(a, ¢) satisfies the three-dimensional wave 


oO) 


: Cu 
equation Au= ap If one now assumes that at the time /=0 a (local) pressure 


fluctuation wo(x) and an initial variation u,(x) of this pressure fluctuation are 
generated, then u(r, f) is the solution of the initial value problem for the three- 
dimensional wave equation as stated in Theorem 19.3.3/1. The domain of 
dependence now exhibits a remarkable peculiarity: for calculating w at the 
point (x, ?) one needs nothing but the knowledge of uo(x) and w,(z) in a small 
neighbourhood of the sphere {y| y€ Rs, ly—a2|=2}. This follows from Theorem 
19.3.3/1 and from (19.3.3/1). Now we regard up and u, as a local perturbation, 
e.g. an explosion, assuming that ws and u, vanish outside the perturbation 
indicated in Fig. 19.13. An observer at the point xr€ Rs will perceive the explo- 
sion only after a finite time. At the time ¢, the sound will set in suddenly, and it 
will end suddenly at the time ¢, (Fig. 19.13). If the domain of dependence has the 
form just described, then the differential equation is said to have the Huyghen- 
ian property. It turns out that the wave equations in 1, 2, 4, 6, 8, ... dimensions 
do not have the Huyghenian property, whereas the wave equations in 3, 5, 7,... 
dimensions have it. What a mercy we live in an odd-dimensional space (apart 
from the &,, which does not offer any comfort anyway)! 


perturbation Fig. 19.13 


Spherical waves: Spherical sound waves are of interest in connection with 


explosions. If one introduces the setup u(z, t)=v(r, 4), with x¢ Rs, 120, r= 
2 


os O2 
=Vxr+x5+2r3, into the three-dimensional wave equation Au=Te , then 
: o2 2 
one obtains the one-dimensional wave equation — (rv) =-—- (rv) for rv(r, 0). 


a4) 
Theorem 19.3.2/1 then shows that y at 


fad) 


1 ii 
Uz, ee vy(r +t) + i v»(r—1). 


1 : 1 P 
—v,(r+t) describes an incoming spherical wave, and — v,(¢t—r) describes a 
‘a J 


propagating spherical wave. 
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19.3.5. The Inhomogeneous Wave Equation, Retarded Potentials 


In this subsection we shall confine ourselves to the case n = 3, i.e., to the three- 
dimensional wave ie If f(a, t)€C2( RF), then 


Ep eo=p f May, (1 


Te west 
where x€ Rs, t>0, is called retarded potential. If «¢ A, and ¢>0 are e given, then 
in (1) only ‘those values of f(y, t) are required which lie on the lateral surface 
Olsiitescone (see Hie 19.14). 


(x,t) 


Fig. 19.14 


Theorem 1. Jf f(x, 0) )ECXRT), then u(x, t)=(Rf\(x, t), with ce Ry and t=0, is 
the unique solution of the initial value problem 


2 
a — Au f(z,t) for xé€h, and t=0, 


ou 
at 


Remark. Thus the retarded potential isa solution of the inhomogeneous wave equation with 


u(x, O}=— (z,0)=0 for xe Rs, where u(z, PeC2 (Raa 


vanishing initial data. ucOXR?) has again the same meaning as in Remark 19.3.3. If we 
low Combine this theorem with Theorem 19.3.3/1, then we obtain the following result. 


Theorem 2. If f(x, t)€C2( Ri), and cf up(x) € C3( Rs) and u,(x) € C2( Rs), then 
a 
w(x, t) = (Bf) (x, t) + (CM (thuy)(2) + = [eM (Quo)(x)], ee Bs, t=O , 


is the unique solution of the initial value problem 
C2u 
an 


u(x, 0) = Ug(x), 


—Au=f(x,t) for xe€h,; and t=0, 


& gee 
By O)=u,(x) for xé Rs, where u(x, t)€C2( RZ) . 


19.4. The Heat Conduction Equation 


19.4.1. The Singularity Solution 


Again leh e=(2), 1 %,)eh,, (en, and (teh eurtier let iia) lave 
the same meaning as in Subsec. 19.3.1. 
n _ la? 


Lemma. s(2,f)=(4nt) 2e “ ws a solution of the heat conduction equation 


F 
== As in Rt, . 
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Remark. For the heat conduction equation, s(z, t) plays the same role as r~"*? in Subsec. 
m Ao 
19.2.1. for the Laplace equation. We recall that As= > —,. Fora fixed t, s(a, t) describes a 
k=l OLE 
bell-shaped surface (error distribution curve for n=1: Fig. 19.15). 


5 


R, : 
Fig. 19.15 


19.4.2. The Maximum-Minimum Principle 

Theorem. Let 2 be a bounded domain in R,, with the boundary 6Q (Fig. 19.16). 

Further lee Z=2x(0, T| with T=0 be the cylinder over Q (including the top 
> 7] 

surface). If u(x, t)€ OZ), u(x, t)EC(Z), and tf = =AuonZ, then u(x, t) assumes 


its maximum and its minimum either on the bottom surface Q or on the lateral 
surface 0020, T']. 


Remark. This is the counterpart to the maximum-minimum principle for the Laplace 
equation as stated in Theorem 19.2.3(c). 


Fig. 19.16 


19.4.3. The Initial Value Problem 


Definition. (¢nitcal value problem or Cauchy problem). If y(x)eC(R,), then a 
function u(x, t)€ C2 Rz,,) ts destred, such that 


7) 

sy (t= (Aula, £) for (x, ERI, 

7) 

Moe NeC he wwand saulr Oj=plx) for xé li, . 
Remark 1. This is the counterpart to Def. 19.3.1/1. 


Theorem. /f o(x)€C(R,) ts a bounded function, then in Ry ,, there exists a unique 
bounded solution of the initial value problem in the sense of the above definition. 
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This solution ts 
me _ lea 
ua, j= (4ri) * fe © ofy)dy for zeR, and Ue (1) 
Tes 


WG O) na) on arena 


Remark 2. (1) is called Poisson’s integral, being obtained from y(y) and the singularity 


solution referred to in Subsec. 19.4.1. If the requirement that u(x, t) be bounded in A 
is dropped, then the uniqueness is no longer ensured. 


Remark 3. This theorem as well as Theorem 19.4.2 show that the heat conduction equation 
has some properties resembling those of the Laplace equation, and other properties that 
are analogous to those of the wave equation. 


Remark + (physical interpretation). Consider an infinitely long rod (e.g., of iron), whose 
lateral dimensions are neglected, the rod being identified with R, (Fig. 19.17). If g(z), 
x€R,, is the heat distribution in this rod at the time t=0, then (1) describes the heat 


a2 
distribution at the time ¢>0. For example, if g(a) =——— e 4, where eis a small positive 
number (heat source at x=0), then & 
a2 
TAGE, (i) = ———@ 4(6+1) 
Var (t+) 


for ¢>0 (singularity solution according to Subsec. 19.4.1.). It is seen how the heat is dissi- 
pated. 


Ry Fig. 19.17 


Remark 5. The physical interpretation of (1) is not fully satisfactory: if @(y) is a local 
perturbation (e.g., like that described in Remark 19.3.2/4), then this perturbation comes 
into effect at all positions «€ R, immediately. This contradicts physical principles, according 
to which information propagates with finite velocity. The corresponding remarks for the 
wave equation have shown that in this case perturbations propagate with finite velocity. 
Nevertheless the heat conduction equation gives a satisfactory description of a great 
number of physical effects (heat conduction, diffusion etc.). : 


19.5. Separation Setups 
19.5.1. Introductory Note 


Separation setups for boundary value problems as well as for boundary and 
initial value problems of partial differential equations are of fundamental 
importance for mathematics and physics. Frequently they allow informative 
physical interpretations. Since the methods are constructive, they are also 
of numerical interest. They can frequently be used to caleulate solutions of 
physical problems by numerical approximation. On the other hand a rigorous 
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mathematical foundation of these methods is rather complicated. Within 
classical mathematics this foundation is governed by many complicated con- 
straints. We refer, for example, to [48]. The theory of Sobolev spaces and of 
generalized solutions offers a natural access to these problems; see for instance 
[66]. We shall not, however, develop this theory here. In the following subsec- 
tions we shall present the fundamental ideas, where we shall frequently adopt 
a formal procedure, dispensing with the precise formulation of assumptions ete. 


19.5.2. The Fixed Loaded Die 


In the x4,%-plane we consider the square elastic plate Q = {x | r€ Ry, O< Fa, 
O<.x,<7}, which is fixed at its edges (Fig. 19.18). A load f(r) ¢Z,(Q) deflects 
the plate. If u(r) is the deflection from the normal position, r¢€Q, then u(x) 
turns out to be the solution of the Dirichlet boundary value problem 


Au(r)=f(x) for xr€Q and u(y)=0 for yea. (1) 


ol" "x — -% Fig. 19.18 


As f(x) ¢€L,(Q) (a very general assumption), by Theorem 18.1.4 it is possible to 
expand f(x) in terms of semiperiodic sine functions, 
fx)= DB Gam sinna,sin mr, DP, ldnml?={lAlZ.q - (2) 
nm=1 nym=1 


Using the setup 
So : 
u(r) = — 5 Sin nx,* Sin May , (3) 
nm=a=i N= + mM 


for u(x), one obtains by (formal) differentiation 


oo 


(Au) (x)= -— > = 58 (sin nay * SIN M2) 
RM = 
(4) 
= 2 _ wm sin nx, * sin mx=f(x) , 
nm= 
Os torey cok: (5) 


Hence (3) is the desired (unique) solution. The elegance of the method is 
clear, but it is even clearer that the foundation is imperfect. A satisfactory 
explanation of (4) and (5) can be given within the theory of Sobolev spaces. 
Then (5) has to be understood in the sense of the boundary values for functions 
in Sobolev spaces. For details we refer to [66]. w(x) in (3) is called a generalized 
solution of (1). If f(x) is sufficiently smooth (for instance if f(x) is differentiable 
on Q up to arbitrary order and has a compact support there), then u(r) is a 
classical solution and (5) holds in the usual sense. One may also formulate the 
following statement. If f(x) ¢ L,(Q), then the above method is permissible, nuiner- 
ically practicable, and (within a new theory) rigorously justifiable. 
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19.5.3. The Separation Setup for the Laplace Equation 


We now generalize the problem stated in Subsec. 19.5.2. to arbitrary connect- 
ed standard domains 2 in R&,. (Standard domains have been described in 
Subsec. 9.3.1. They are always bounded.) Thus, given f(a) € L(2), and desired 
a function u(x) such that 


Aule)=f(i7) for x€Q@ and uiy)=0 for ye. ay 


First of all one will ask for an analogue to the (complete orthonormal) system 
of semiperiodic sine functions of Subsec. 19.5.2. A real-valued function v(x) 
is called a (normalized) eigenfunction of the operator A (with zero boundary 
values) if there exists a real number 4 such that 


Avigy=s0@) for wel, (2) 
lvIz.~2)=1 and v(y)=0 for yEeeQ. (3) 


It is immediately seen that for 2=Q the sine functions of Subsec. 19.5.2. are 
eigenfunctions. 4 is called eigenvalue. If Q is an arbitrary connected standard 
domain in F,, then the following statement is true: 


There exists an orthonormal system {v,(x)}21 of eigenfunctions in the sense of 
(2), (3) that is complete in [y(Q). 


If 4; are the corresponding eigenvalues, i.e., if Av; =A,v;, then 0O>1,2=A)2=4,=... 
+-—co as j>co, Here Av; as well as »; lag =0 have to be understood in the sense 
of the theory of Sobolev spaces, which shall not be discussed here (cf. [66]). If 
the boundary of 2 is sufficiently smooth (for example, if it is differentiable up 
to arbitrary order), then v;(x)€C2(Q2), and Av; as well as v; | @2=0 are to be 
understood in the classical sense. But this allows us to transfer the method of 
Subsec. 19.5.2. to the present, more general case. f(x) €D,(Q) is expanded in 
terms of the complete orthonormal system {v,(x)}7_; (cf. Theorem 17.3.3), 


Ka)= 2, are(2) for re, Z lax? =lAlzca - 
i= =I 


Using the setup 


for u(x), one obtains by formal differentiation 
AQ = Noes D>, Ge,(2)=flx) for xe 
: k=1 


and u(y)=0 for yEéQ2. Hence u(x) is the (unique) solution of (1). Again the 
same remarks as in Subsec. 19.5.2. are applicable: the calculations can be given 
a rigorous foundation within the theory of Sobolev spaces; u(x) is a generalized 
solution. If 2 is a sufficiently smooth domain and if f(x) is sufficiently smooth, 
then u(x) is a classical solution and w|,g=0 holds in the usual sense. The 
numerical practicability of the method depends on whether or not the eigen- 
functions u;() and the eigenvalues 4; can be calculated explicitly or by numeri- 
cal approximation. 
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19.5.4. The Fonrier Method for the Wave Equation 


Consider the initial and boundary value problem as per Def. 19.3.1/2 for 
the homogeneous wave equation with zero boundary conditions: let Q be a 
connected standard domain in Ry, and let Z=2 x (0, ~) (Fig. 19.19). Desired a 
function u(x, ¢) such that 


@2 

a = AU for (x, A€Z, u(y,é)=0 for yee2 and ¢>0, (1) 
Ou 

u(x, 0)=u9(x) and ay OVj=a,(z) for reo. (2) 


Fig. 19.19 


By Theorem 19.3.1/2 the uniqueness is ensured (at least for smooth functions). 
We ask for the existence and the construction of the solution. First of all we 
look for non-trivial solutions v(x, #)= X(x) T(t), such that 


2 


2y 


ea ae for (x,H€Z and X(y)=0 for yea. (3) 


iad) 


Insertion of v= XT gives 
Ae) ee) 
2063) 7 IE) 
where 4 must be a constant. Together with (3) this shows that X(x) is an eigen- 


function in the sense of Subsec. 19.5.3., ie., that X(r)=v,(x) and A=4A,. Then 
T’’(t)=4, T(t). Since 4, <0, one obtains 


[a 
vel, = ol) (age + bye MH), (4) 
where we have written »,(z, ¢) instead of v(x, ¢). a, and b; are arbitrary (complex) 
constants. If one makes the setup 


GG) aes v, (x) (ene ele Maly , (5) 
r=1 


then (1) is formally satisfied (i.e.) without regard to convergence problems). 
If up(x) € £o(Q) and u,(x) €Lo(Q), then up and w, can be expanded in terms of the 
complete orthonormal system {v,(x) }¢-1, 


oo 


U(x) = 2 Cre (2); A es Ay; (x) « (6) 
k=1 C= 
On the other hand (5) (formally) implies that 
co é ——— 
u(x, 0 =A (ay + by) Vx (2), = (x, ae Zi VlAgl (ee — Be) Vela) - (7) 
k= = 4 
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Jf a, and 6; are so chosen that a,+b,=c, and iV|A,| (a,—b,)=d;, then (2) is 
also satisfied. Further one has u(y, ¢)=0 for y€dQ and ¢+0. All the considera- 
tions (problems of convergence, differentiability etc.) can be justified on the 
basis of the theory of Sobolev spaces. We refer to [66]. In this sense w(a, ¢) is 
the (generalized) solution of (1), (2). 


Physical interpretation: For n=2, the problem (1), (2) describes the vibrations 
of a membrane with fixed edges which at rest extends over the domain Q in the 
%4,7-plane. u(x,t) is the deflection from the position of rest at the time ¢ and 
the position x€ Q. According to (5) u(x, ¢) is the superposition of the fundamental 
modes 


+iy 2, |é 
= V il , 


Up (x) x€Q, t€R,. 


1 Val Mi 
ies ue Thus the fundamental tone corresponding to |/,| is superposed 
k 


by overtones which correspond to [Ay], |As|, ... 


2am 
Such a fundamental mode has the period 7,=—— and the frequency 


19.5.5. Vibrating Membrane, Vibrating String 


Vibrating membrane: We refer immediately to the last statements made in 
Subsec. 19.5.4., considering the square membrane with fixed edges, Q=Q= 
= {(a4, X) | O<ay<7, O<x)<7} (Fig. 19.20). In this case the eigenfunctions 
{v;(x)}g_1 are explicitly known: 


lee : i 
iB SIN Nx, SIN MX , Theorem 18.1.4 
TT 


nym=1 
(see also Subsec. 19.5.2.). The eigenvalues are Anm= —n*—m*, Hence the general 
vibration is 
= (An,mell™ +m +b, ne tnt +m) sin na, + sin mx - 
m= ( 
or 


x, Fig. 19.20 0 ce ie, eh 
Vibrating string: We return to the problem of the fixed-ended vibrating string 
as referred to in Theorem 19. 3.2/3 (Fig. 19.21). The remarks made at the end of 
Subsec. 19.5.4. are of course also valid for n=1 and Q=(0, x). By Theorem 


2 
18.1.4, Me sinkx are the eigenfunctions and 4,=—k? are the eigen- 
Te 


values. Hence the general vibration of a fixed-ended string of length z is de- 
scribed by 


Ue) = 2 age pene) sinikae 
Resi 


: h 
The frequencies of the fundamental modes are »;, =5-: 
aT 
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19.5.6. The Fourier Method for the Heat Conduction E quation 


By analogy with Subsec. 19.5.4., we now consider the initial and boundary 
value problem for the homogeneous heat conduction equation with zero bound- 
ary conditions: let 2 be a connected standard domain in #,, and let Z=Q (0, <). 
Desired a function u(x, ¢) such that 


Ou 
am Autor (a ieZ, wey,t)=0 for yeeQ and 1+0, (1) 
ieee) ania tor wae Se . (2) 


Again it can be shown that this problem has at most one solution (at least for 
sufficiently smooth functions). We ask for the existence and the construction 
of this solution. As in Subsec. 19.5.4., one first looks for non-trivial functions 
Cie) — eg (such that 


a 
== Av for (x,t)€Z and X(y)=0 for yedQ. 
The functions must satisfy the condition Ce pe 

X(x) P(t) 
stant. Hence, as in Subsec. 19.5.4., X(x) is an eigenfunction v,;(xr) in the 
sense of Subsec. 19.5.3., with the eigenvalue 4,. For 7(¢) one then has 7'(¢) =a,e* 
Jf we now, by analogy with (19.5.4/5), make the setup 


=A, where /4 is a con- 


ee, UE) > ApUg (2) ofl ; (3) 
k=1 


then (1) is formally satisfied. If u(x) € L.(2), then uo(x) can be expanded in the 
same way as in (19.5.4/6). If we now insert a,=c; into (3), then (2) is also 
satisfied, and we obtain the desired solution. Again it is possible to justify all 
calculations within the theory of Sobolev spaces (we refer to [66]). In this 
sense u(x, ¢) is a generalized solution of (1), (2). 


Physical interpretation: If n=3 and ¢(y)=0, then u(z, ¢) is the solution of the 
“heat conduction” problem of Subsec. 19.1.2. Since 4,<0, one has w(x, t)-0 
as to, as it should also be expected, because the temperature on 62 is always 
zero. Ifn=1and 2=(0, x), then by analogy with Subsec. 19.5.5. we obtain 


Or a 
aia ara 


1) = ae 

k=l 
This can be interpreted as a temperature distribution in a rod of length = which 
is heat-insulated in the lateral directions and whose ends have the temperature 


Zero. 
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20. Operators in Banach Spaces 


20.1. Banach Spaces 
20.1.1. Separable Banach Spaces 


Real and complex Banach spaces have been dealt with in Chap. 6. From now 
on a Banach space shall always be a complex Banach space if not explicitly 
stated otherwise. 


Definition. A Banach space B is called sepurable if there exists a countably infinite 
set {ap ie_,c B that vs dense in B. 

Remark 1. By Def. 17.3.1/2 (and Remark 17.3.1/3), Hilbert spaces are special separable 
Banach spaces. Denseness of a set in a Banach space is defined in Def. 14.6.3. 

Remark 2 (examples). The Banach spaces bl =l.cl and J,[=1,,¢] referred to in Subsec. 
6.2.2., with 1=p<~, are separable: the set described in Subsec. 17.3.2. with respect to /, 
is also dense in /,, with 1=p<e. If Qis a domain in R,, then, by Remark 17.3.2, L,(Q) is 
also a separable Banach space for 1=p<-. If Qis a bounded domain in F,, then it can be 
shown that the countable set of polynomials > ee ie ae with (complex) rational 
coefficients a,, is dense in L,(Q) for 1 = p<. This is rather easily deduced from the consid- 
erations described in Subsec. 18.2.1. 


20.1.2. Special Sets in Banach Spaees 


Definition 1. Let B be a Banach space, and let FE be a nonvord set in B. 

(a) E ws said to be closed if {xe}, CE and x, >x as k> - implies that x€ H. 

(b) # is said to be bounded if sup |la|| <0. 

Cee 

(c) FE ts said to be precompact tf from every sequence {xp} C E tt is possible to 
select a convergent subsequence {Xp} 

(d) EK is said to be compact if EF is precompact and closed. 
Remark 1. We recall that #,—-2 as h>- means that thaw lla, —x||=0. Hence EF is said to 
be closed if the limit of convergent sequences in # also lies in /. 
Theorem 1. Kvery precompact set ina Banach space is bounded. 


Remark 2. In general the converse is not true; scc also Theorem 20.1.4(a). 


Definition 2. Let N and M be any two sets in a Banach space B. If e=+0, then 
N is called an e-net for M if Mc U f{y| xéB, |lr—y||<e}. 

Wea 
Remark 3. #,(x) ={y | y€ B, |je —y||<c} is a ball with centre x and radius «. The definition 
says that such balls cover A/ if their centres belong to N. 


Theorem 2. (a) A set in a Banach space B is precompact if und only tf for every 
e=O there exists a finite e-net. 

(b) A set in a Banach space B is precompact tf and only if for every e+ there 
exists a precompact e-net. 


Remark 4. It would be wrong to suspect that proposition (b) is of no use. 
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20.1.4. 20.1. Banach Spaces Zul 
20.1.3. The Space C(Q) 


af ; . . . = 4 
In Subsecs. 2.5.2. and 6.1.2. we considered the real Banach space Cla, b]. 
In Subsec. 19.2.1. we further introduced C(Q) as an abbreviating notation. 
Now we consider complex-valued functions, but retain the previous notations. 


Definition 1. Zf Q %s a bounded domain in Ry, then C(2) = f(x) | f(x) complex- 
valued and continuous on Q}, with - 


Illes = Sup f(x)! « (1) 
reD 


Remark 1. As f(x) is continuous on the closed set 2, f(x) is also bounded. Hence (1) is 
meaningful. 


Theorem 1. /f ¢x C(Q) the addition of functions is defined by (f+9)(a) = f(x) +9(x); 
and if the multiplication of functions f(x) by complex numbers 4 ts defined by 
(Af)(x) =A4f(x), then C(Q) is a Banach space with respect to the norm (1). 


Remark 2. The case x =1 and 2=(a, b) is largely in accordance with Theorem 2.5.2. 


Definition 2. 7f M vs an arbitrary (not necessarily countable) index set, then a 
family of functions {fu(x)}uear tn C(Q) vs equicontinuous if for every e=O0 there 
exists a number 6=6(e) such that |fx(x)—fuly)| Se for all we M and for all x€Q 
and ye Q with |r—y| 6. 

Remark 3. From Theorem 2.3.2/4 (and from its n-dimensional counterpart) it follows that 
every function /€C(Q) is uniformly continuous on Q. Thus the meaning of the definition 
does not consist in the independence of 6 with respect to x, y, but rather in the independence 
Oli Mig By Th 


Theorem 2 (theorem of Arzela-Ascoli). A family of functions {fu(x)}ucar in C(Q) 
7s precompact if and only if {fu(x) }ucar ts a@ bounded and equicontinuous set in C(Q). 


20.1.4. Finite-Dimensional Banach Spaces 


A complex linear vector spaee B as defined in Subsec. 6.1.1. is called n-dimen- 
sional if there exist m linearly independent vectors a, ...,.7, in B, whereas 
every set of »+1 vectors in B is linearly dependent. Here y,€B, k=1,..., m, 
are called linearly dependent if there exist eomplex numbers 2;, k=1, ..., m, 


™m m 
such that > lAz|>0O, and > Axy,=0. If there are no such numbers A, ..., Am, 
kat Rat 


then y4, oe are called linearly independent. The spaces C,, of Subsec. 6.1.2. and 
5 [=13,c] of Subsec. 6.2.2. are n-dimensional. If x, ..., an have the above mean- 
ing and if r€B, then there exist complex numbers 4, A, ..., a, such that 


n n 
Ax +t > Apap=Oand |A|4+ > |a,|>0. Since’ must be different from zero, one obtains 
k= k= 


i 
= > era, Pp complex, 
k= 


which meaus that .c,, ..., %, form a basis in B. Thus the “co-ordinate spaces” 
Cy, and 1% are already the most general n-dimensional (complex) linear vector 
spaces. In Def. 6.2.2/2 we had defined when two norms on a linear vector space 
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are called equivalent. A proposition on equivalent norms in J has been made 
in Theorem 6.2.2. It turns out that this theorem can be substantially generalized 
as follows. 


Theorem. (a) A Banach space is finite-dimensional if and only if every bounded 
set 1s precompact. 
(b) Adl the norms defined on an n-dimensional linear vector space are equivalent. 


Remark 1. Confer Remark 20.1.2/2. 


Remark 2. A normed finite-dimensional linear vector space is always complete, i.e., it is a 
Banach space. Part (b) then shows that in effect there is only one way to define a norm on C,. 


20.1.5. Completion of Normed Spaces 


In Remark 20.1.4/2 we just stated that normed finite-dimensional linear 
vector spaces are automatically complete. This is not true for general (infinite- 
dimensional) normed spaces. The polynomials P(a) over 2=(—1, 1) forma linear 
vector space that can be normed by || P(2)||z,:@). But this space is not complete, 
as can be deduced, for example, from Theorem 18.2.2. This raises the problem 
of completing a given (incomplete) normed linear vector space to obtain a 
Banach space. 


Definition. /f M, ts a linear vector space with the norm ||x|\,, and if My is a linear 
vector space with the norm ||x\|o, then M,and My are called isometric-isomorphic 7f 
there exists a linear mapping I of M, onto My such that ||L2||,=|la||, for all ee My. 
Remark 1. Linearity of the mapping means that [(Aa+yy)=Al(x) +1 (y) for all eM, 
y¢M,, 4 and » complex. That J maps JZ, onto My means that the range of J is the entire Mb. 
From ||/a\|)=||2||, it follows that the mapping is one to one. In particular the definition is 
symmetrical in J7, and My (one only needs to replace J by the inverse operator J~! in order 
to interchange the roles of AZ, and JZ,). 


Theorem. /f M, is a linear normed vector space, then there exist a Banach space B 
and a linear subset My that is dense in B, such that M, can be mapped onto M, 
in an wometric-isomorphic way (Fig. 20.1). 


fi 
Oy Pe 


Remark 2. Thus in the sense of the theorem it is possible to complete every linear normed 
vector space to a Banach space. (In fact, it cannot be observed from the figure that all 
spaces are linear.) 


20.2. Operators 
20.2.1. Fundamental Concepts 
Definition. Let By and By be Banach spaces, and let D be a linear subspace (vector 


space) of By. 
(a) A mapping A of D into Bg is called linear if, for all x€D and y€D and for 
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a lg 


all complex numbers hand pp, A(Ax+py)=AAr+pAy. Here D= D(A) ts called 
the domain of definition of A. 

(b) A linear mapping A of D into By, with D=D(A)= B,, 7s called continuous 
af tea in Bras ko always implies that Axy—> Ax in By, 

(ce) A linear mapping A of D into By, with D=D(A)= By, 7s called bounded if 
there exists a positive number ¢ such that ||Ax||p, Scellal|z, for all xe By. 

(d) A linear mapping A of D into By, with D=D(A)=B,, is called compact 
af A maps every bounded set in B, intoa precompact set in Bo. 
Remark 1. We write Az instead of A(x) if there is no fear of confusion. In (b), (¢) and (d) 
we assume that the domain of definition is D(A) = B,. It is only operators of this kind that 
shall be discussed in this chapter. For the later investigations, however, the weaker formu- 
lation (a) will be of use. A mapping into B, is one whose range lies in B, but is not necessa- 
rily identical with the entire By. If the range is the entire By, then we speak of a mapping 
onto By. 


Theorem. (a) A mapping A vs bounded if and only if tt 7s continuous. 

(b) A compact mapping A vs continuous. 

(¢) A mapping A is compact if and only if from every bounded sequence {x31 C 
< B, tt is possible to select a subsequence {Xx biz1 Such that {Axp,}72 1, converges. 


Remark 2. All operators (=mappings) considered by us are linear, so that we will not 
indicate this explicitly. 


20.2.2. The Space (8), B2) 


Definition 1. 7/ B, and By are any two Banach spaces, then L(B,, By) is the class 
of all continuous operators that map B,= D(A) into Bg. 
Remark 1. L(B,, Bj) will become a linear vector space if the addition of continuous op- 
erators A, and A, in L(B,, By) and the multiplication by complex numbers 4, and A, are 
defined by 

(A,A4 +},A9)(x) =A, Ayr + AgA ot for all rE B, 5 
It is easily seen that 2,4, +A)A> again belongs to L(B;, Bo). Further 


|A||= sup ||Azllz, (1) 
lz, 1 
is a norm on L(B,, Bs). One has 
|All= sup |[/Az||~z, and ||Al|=infic, 
Itz, =1 


where the infimum is taken over all numbers ¢ with ||A2|| z, S¢|la||n, for all ~€ By. 


Theorem 1. L(B,, By) with the norm (1) is a Banach space. The class of all com- 
pact operators in L(B,, By) forms a closed subspace in L(B,, By). 

Remark 2. A subspace is a linear subset. With the norm (1) the compact operators in L(.B,, By) 
thus also form a Banach space. 

Remark 3 If A,€L(B,, By) and A,EL( By, bs), then AA, EL(B, Bs), where AA, ig 
defined by (424,)(z) =Ao(A,x). One has {|A2A,||S|| All - |All 

Definition 2. Let B, and By, be any two Banach spaces. Further let A_and A be 
two linear operators, D(A)< D(A)c By, which map into By. Then A is called 
extension of A if Ax= Az for all x€ D(A). 
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Theorem 2. Let By and By be any two Banach spaces. Let A be a linear operator 
mapping into By, whose domain of definition D(A) 7s dense in B,. If there exists a 
positive number e¢ such that ||Ax||z, =ellal|z, for all x € D(A), then there exists exactly 
one operator A €L(.B,, By) that is an extension of A. 


Linear functionals: If B, =C, is the complex plane, then one writes L(B,C,) = B’, 
where 6,= 8B is a Banach space. f€ B’ is called a lincar continuous functional. 
Thus /¢ B’ maps the Banach space B into the complex plane. If B,=C,, then 
it is possible to strengthen Theorem 2 substantially. For that purpose we need 
the concept of a continuous linear form. Let B be a Banach space, and let f be a 
linear operator in the sense of Def. 20.2.1(a) that maps D(f)c B into the com- 
plex plane C,. Then f is called a continuous linear form if there exists a positive 
number c such that |f(x)| =ellflz for all x€ D(/). 


Theorem 3 (Hakn-Banach theorem). If f 7s a continuous linear form with D(f)< B, 
then there exists a linear continuous functional {¢ B’ that is an extension of f. 
Here it vs possible to achieve that 
l= sup If(x)l= sup |f(x)l=|I/). 

x]=4 hc = t 

vED SF) teB 
Remark 4. In contrast to Theorem 2 it is not required that D(/) be dense in B. On the other 
hand the uniqueness proposition of Theorem 2 is missing here. The proof of Theorem 3 is 
much more complex than the (simple) proof of Theorem 2. 


20.2.3. Speetrum and Resolvents 


If B, and By, are any two Banach spaces, and if A € L(B,, By), then 

R(A) = {y |.y€ By, there exists an x€ B, such that y=Azx} 
denotes the range of A. If A €L(B,, By) is a one to one mapping of B, onto f(A), 
then A~!, with x=A~!y for y=Ar, is called inverse operator (to A). One has 
D(A~!)= R(A)c By. It is easily seen that A~! is a linear operator. (In any case 
Ai(A) is a linear subspace of By.) If By= B,= B, then we write LB) = 1B B) 
By F we denote the identity operator in B. 
Definition. Jf Bis a Banach space, and if A€L(B), then 

Ma={1|A€O\, there exists (A —AE)-!, and (A —AE)~!€ L(B)} 
us called the resolvent set (of A), and S4=C\M 4 ts called the spectrum (of A). 


Remark 1. In the complex plane C, we thus ask for those complex numbers 4 for which 
A—AEH is a one to one mapping of B onto B, R(A —AE)=B, and for which the inverse 
operator (4 —A#)~! (which then exists) is continuous. The set of all these numbers is 
called resolvent set, the difference is called spectrum. For A€ A/4, (A —AE)~! is called 
resolvent. 


Theorem. Jf Bis a Banach space, and if A€L(B), then 
{A| AEC, |Al>||Al]} My . 


N 4k y= 
For |A\=||All, ce ea is a Cauchy sequence (or (ene mental sequence) in 
al civ © e 
L(B), and the limit of this sequence is (A —AB)~!= ~ 2 aH (Neumann series). 


Remark 2. Here one has 49=#, A2—AA, Ae +! = 444 for k=2,3,.... Hence the spectrum 
of an operator 4 € L(B) lies in a disk of radius |j4|| about the origin. 


LNs) 
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20.2.4. The spacens,)’ 


If 1sp<-, then let 7» [=Ip] be the (complex) Banach space referred to in 
Subsec. 6.2.2. We ask for (/,)’, the s eke of the linear continuous functionals over 
/» in the sense of Subsec. 20.2.2. If f€ (/,)’, then, as previously, Ilan = sup f(x) 

lla, SI 


Theorem. Let 1< p< tes tee =. 
Pp 
(a) Lf y=(Y, Yr, +) Ely’, then (lp) contains every f with 
x)= 3, weve for 2= (ty ty) Ely (1) 
4 


\ 


Furthermore, IMlapy =lle B 


(b) If fE(p)’, then there exists exactly one element y€ly such that K(x) can be 
represented by (1). 


Remark 1. The fact that the element / in (1) belongs to (i,)’ is in effect an Eoplication of 
Hlder’s inequality (6.2.1/1). The theorem i remains true for p=1 if one sets p’ =< and 


L={x | r= (x1, 22. -..), Dele [24] <0}. 


Remark 2. It is eas, to deduce from the above theorem that (2,)’ and L,, are isometric-iso- 
morphic in the sense of Def. 20.1.5. Therefore one also writes (I,)’=1l,,, where it must, 
however, be kept in mind that this is to be understood in the sense of (1). 


Remark 3. If 2 is a domain in #,, then the (complex) Banach space L pif) has the same 
meaning as in Remark 17.3.2 and in Remark 20.1.1/2. If l=<p~<e, Ail if ge L,(Q), then 
(L,(Q))’ contains every . with 


{(h ah h(x) g(x) dz for hEL,(2). (2) 
Furthermore, Peters a2). Conversely, every /€(£,(Q))’ can be represented in the 


form (2) in a unique way, with g¢€LZ,,(2). In this sense one ce Ent) 2). 


20.2.5. Integral Operators 


If 2 is a bounded domain in F,, then we consider the integral operator A, 
(Af)(x aes K(x, y) fly) dy, xre2, (1) 


by which every feces f(x) in 2 is assigned a funetion (Af)(x) in 2 (provided 
that the integral exists). A(x, y) is defined over 2 Q and is called the kernel 
of the integral operator. A is a linear operator (provided that it exists). 


Theorem 1. Jf K(x, y)€C(2X Q), then K is a compuct operator in L(C(Q)). 


Remark 1. The space C(Q) has been considered in Subsee. 20.1.3. Further L(C(Q)) has the 
same meaning as L() in Subsec. 20.2.3., with B=C(Q). 

Remark 2. If A(x, y)€ Ly (QQ), then from Fubini’s theorem in Subsec. 14.4.4. it follows 
that, for a fixed x€Q2, K(x, y) as a function of y belongs to £,(Q) for almost all #€Q. For 
such x€Q, K(x, y) f(y) as a function of y is then integrable, provided that f(y) € £2(Q). It is 
then possible to form (1). The following theorem is to be understood in this sense. 
Theorem 2. If K(x, y)€ Lo(2 xX Q), then K is a compact operator in L1(2)). 
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“lie Operators in Hilbert Spaces 


21.1. Classes of Continuous Operators 
21.1.1. lsomorphy of Hilbert Spaees 


Hilbert spaces have been introduced in Sec. 17.3. All notations shall have the 
meaning indicated there. By Def. 17.3.1/2, in a Hilbert space there exists a 
countably infinite dense subset. Hence Hilbert spaces are special separable 
Banach spaces in the sense of Def. 20.1.1. 


Definition. Two Banach spaces B, and By are isomorphic tf there exists an opera- 
tor ACL(B,, Bo) that has an inverse operator A~1€ L(B,, By). 


Remark 1. What is required is the existence of a continuous operator A, with D(A)=B, 
and k(A)=6,, which provides a one to one mapping of B, onto By, while its (hence ex- 
isting) inverse operator A~! provides a continuous mapping of B, onto B,. If y= Az, then 
it followstfrom||y|lz, $||Alj |lzlle, and ||| 5, s||A7"| llyllg, that there exist two positive numbers 
c, and ¢y such that for all x€ B, 

ellylle, = ||e|_2, Seallyllz, with y=Az. (1) 
The above definition is thus a generalization of the concept of equivalent norms as per Def. 
6.2.2/2 and a generalization of the concept of isometric-isomorphic Banach spaces as per 
Dstt, 240), 11355- 
Remark 2. According to Subsees. 17.3.2. and 20.1.4., C, is an n-dimensional Hilbert space. 
Now it is easy to deduce from Theorem 20.1.4(b) that every finite-dimensional Hilbert 
space is isomorphic to C,, with a suitably chosen m (the dimension of the space). If a Hilbert 
space is called infinite-dimensional if it is not finite-dimensional, then the following state- 
ment can be made. 


Theorem. An cnfinite-dimensional Hilbert space is isomorphic to Io. 


21.1.2. Linear Funetionals 


Theorem (/'. Riesz, Fischer). Let H be a Hilbert space, and let H’ be the space of 
the linear continuous functionals over H. 

(a) Lf ycH, then f(x) =(2x, y) is a linear continuous functional over H, and 
[yl = Ila 

(b) If fe’, then there exists exactly one yCH such that f(x) = (2x, y) for all xe. 


Remark. In the sense of this theorem one has H=H’. If H=l, or H = L,(2), then the 
theorem (largely) agrees with Theorem 20.2.4 and with Remark 20.2.4/3. ~ 


21.1.3. Bilinear Forms 


Definition. 1f 1 ts a Hilbert space, then L(x, y) is called a bounded bilinear form 
af to every ordered pair xe H and ye H acomplex number L(x, y) with the following 
properties is assigned: 
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(a) Lf a4, xo, y are elements of H, and af Ay, Ay are complex numbers, then 
L(Ayry + Agro, y) = AL (ay, y) + AL (xo, y) , 
Ly, Ayeyt Agate) =A L(y, #4) + AyL(y, 29) « 

(b) There exists a positive number ¢ such that, for all x€H and y€H, 
IL (x, y)| Sellatlallylla - (1) 


Remark 1. L(x, y) =(2, y) is a bounded bilinear form. If €L(H), then (Ax, y) and (x, Ay) 
are likewise bounded bilinear forms. 


Remark 2. By analogy with the norm of operators and functionals, we set 


|Zi= sup |L£L(z,y)|= sup |L(z, y)|=infc, 
ail =I] =1 {[2| 4 
yl st 


where the infimum is taken over all numbers c that satisfy (1). 


Theorem. [Ff L(x, y) is a bounded bilinear form in the Hilbert space H, then there 

exist exactly one operator A,€L(H) and exacily one operator Ay€L(H) such that 
Lx, y)=(Ayx, y)=(x, Agy) forall r€H and yeH. 

Further, ||E|| =||A j|=||Aal. 


Remark 3. So the examples stated in Remark 1 are already the most general bounded bilinear 
forms. 


21.1.4. Adjoint Operators 


Definition. Zf H 7s a Hilbert space, and if AC L(H), then A*€L(H) is called the 
adjoint operator of A vf, for all r€H and all yEeH, 
(Ax, y)=(x, A*y) - 


Remark. A ¢€ Z(H) implies that L(x, y) =(Az, y) is a bounded bilinear form. Hence, by Theo- 
rem 21.1.3, there exists exactly one operator A*€Z(//) such that (Az, y)=(x, A*y) for all 
z€Hand all y€H. This justifies the definition. 


Theorem. Let H be a Hilbert spuce. 
(a) If AC L(A), then ||Al|=||A*| and (A*)* =A. 
(b) Jf A,€ L(A) and Ay€ L(A), and tf 44, Ay ure complex numbers, then 
(Aja, +49A)*# =A, AP +A, AF and (A\Ay)*= ASAP. 
(ce) Ff A€L(A), and 7f A-!€L(H) exists, then (A*)-!€L(H) exists too, and 
(A¥)1= (Ane, 


21.1.5. Projection Operators 


Definition. Let H be a Hilbert space, and let H, be a closed subspace of H. [f x= 
=x, +a} ts the representation of x in the sense of Theorem 17.3.5, then the operator 
Puitth Px=x, ts called projection operator (or projector). P projects H onto Hy. 
Remark 1. It is easily scen that the correspondence x ->2,, that means P, is linear, and that 
D(P)=H and ||Px||=||z. Hence PEL(H). If x€f,, then Px=z. Hence it follows that 
|| P||=1, provided that //, is at least one-dimensional. 
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Theorem. /f H is a Hilbert space, then P€L(H) ts a projection operator uf and 
Og Pe 


Remark 2. The corresponding projection space is 11, ={a | x€H, Pips ay” 


21.1.6. Isometric and Unitary Operators 


Definition. Let H, and Hy be any two Hilbert spaces. 
(a) Zf V as a linear operator in the sense of Def. 20.2.1, with DV) cH, and 

RV) CH», then V is called isometric if || V rll, =l\eclla, for all x€ D(V). 

(b) An tsometric operator in the sense of (a) vs called un temp tf DV \ Fe and 
HV Ves. | 
Remark. A unitary operator belongs to L(1/,, 1»). If for any two Hilbert spaces H, and 
fy there exists a unitary operator, then these spaces are isometric-isomorphic in the sense 
of Def. 20.1.5 (and hence also isomorphic in the sense of Def. Ailell a). 


Theorem. [f H, and Hy are any two Hilbert spaces, and if V ts an isometric 
operator am the above sense, then 


(Ve, Vy)m=(% yn, forall x€D(V) and yeD{V). 


The inverse operator V~! is an isometric operator which maps D(V~-1\= RV) in 
Hy onto R(V~!)=D(V) in Hy. If V ts unitary, then V-! is unitary, too. 


21.1.7. Compaet and Degenerate Operators 


The definition of a compact operator has been given in Def. 20.2.1(d). 
Definition. [f H is a Hilbert space, then A € L(H) is called degenerate tf the dimen- 
ston of ats range R(A) is finite. 

Remark 1. Tt is casy to see that every degenerate operator is also compact. 


Lemma. /f H is a Hilbert space, and if AC L(H) ws degenerate, then A can be 
represented in the form 


m 
Az= 7 Gye) e forall eee . 
4 


flere {ex}e-1 ts an orthonormal system and m is the dimension of the (finite-dimen- 
Sk 
sional linear) space R(A). Further, A* is also degenerate, the dimension of R(A*) 
ts im, and 
m 
Ate > (6:6; forall ane . 
kai 


Remark 2. The dimension of a linear vector space was explaincd in Subsec. 20.1.4. 


Theorem. Let H be a Hilbert space. 
(a) AC L(A) ts compact if and only if there exists a sequence {Apiii1 ec L(A) of 
degenerate operators such that lim |Az,— Al] =0. 


(b) AC L(A) ts compaet if and only if A* is compact. 


‘ 
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21.2. The Theory of Riesz and Schauder 


21.2.1. Formulation of the Problem 


mm 
The solution theory of algebraic equations, try, weitle— 1, 2, ie 
k= 
is well known. Here (7y)/-; is a given matrix. The properties of this matrix as 
well as of the adjoint inatrix (rank, subdeterminants ete.) detemnine the theory 
of these equations. We look for the infinite-dimensional analogue to this theory. 
If H is a Hilbert space, and if 4 ¢L(H), then one may ask for which elements 
y¢H the equation 4r=y has a solution, and what can be said about the set of 
all solutions. Here it is expedient to consider the family of equations (A —AF)x = 
=Ar—Ar=y vather than Ar=y alone. Here FE is the identity operator in H, 
and / isan arbitrary complex number. If 7€ M4, then, by Def. 20.2.3, (A —AE)x= 
=y has a unique solution r=(A —AE)~'y for every y€H. Hence a solution 
theory for the equation (A —AF)x=y is closely related to the determination of 
the resolvent set M4 and of the spectrum S4 as per Def. 20.2.3. 
Detinition. Let H be a Hilbert space, and let AC L(H). 
(a) R(A)={y | y¢H, there exists an xCH such that Ar=y} is called the range 
of A, and N(A)={y| Ay=0} ts called null space. 
(b) AEC, vs called eigenvalue of A if there exists an element «+0 such that 
Ax=dr. The dimension of N(A —2E) ts called multiplicity of the eigenvalue A. 


Remark 1. It is clear that R(A) and (A) are linear subspaces of 7. If N(A —AE) is not 
finite-dimensional, then / is said to be an eigenvalue of infinite multiplieity. Further it is 
easy to see that in any case V(d —AL) is a closed subspaee. If / is an eigenvalue, then 
AES 4. 


Remark 2. If R(A—-AL)=H, then Ax—/x=y has at least one solution for every yéH. 
If NV (A —/AE) ={0} (the set consisting of the zero element alone), then for a given y€// the 
equation Ax —/x=y has at most one solution. 


21.2.2. Decomposition Theorems 


If H, is a (linear) subspace of the Hilbert spaee H, then 
Hy= fc | «¢ H, there exists a sequence {xz}. A, such that 1-2} 
is the closure of H,. It is easy to see that H,, is a closed (linear) subspace of H. 
Theorem. Let H bea Hilbert space, let AC L(H), and let 4 be a complexe number. 
(a) H = R(A —AE)@ N(A* —1E) = R(A*—AE)@N(A-AE). 
(b) If A is compact and if A+0, then 
H = R(A—AF)@ N(A* -(E) = R(A*-AE)@N(A-AE). (1) 


Remark. @ has the meaning indicated in Subsec. (7.3.5. The essential proposition of the 
theorem is contained in (1): R(A—Z#) as well as h(A*—J/E) are closed subspaces if A is 
eompact and A4+0. Aeeording to (1), Ax—Ja=y has a solution if and only if (y, z)=0 for 
alllzeN(A*—/#), that means if A*2—72. 
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21.2.3. The Spectrum of Compact Operators 


We had already stated that an eigenvalue 4 of an operator A ¢ L(H) belongs 
to the spectruin S4 of the operator A. In contrast to the finite-dimensional case, 
it ay however happen that there are complex numbers 2¢€S4 which are not 
eigenvalues. In this case the inverse operator (A —AE)~! exists, but it does not 
belong to L(H). The following (rather far-reaching) theorem now shows that 
compact operators (unhke any other operator in L(#)) largely behave as one is 
accustomed to from the finite-dimensional case. 


Theorem 1. Let H be an tnfinite-dimensional Hilbert space. The spectrum S4 
of the compact operator A€ L(H) consists of the point A=0 and of at most countably 
many eigenvalues that can accumulate only at the origin. Every nonzero eigenvalue 
as of finite multiplicity. Further, S4x={A| ACO, AE S84}. 

Remark 1. This theorem together with Theorem 21.2.2 enable a remarkable conclusion to 
be drawn: let d¢L(H) be compact, and let 4+0. Then the equation Ax —Ar=y has a 
unique solution for every y¢€H if and only if A is not an eigenvalue; further one has 
N(A* —/£) = {0} if and only if N(A —AB) ={0}. The last proposition can even be strengthen- 
ed as follows. 


Theorem 2. If H is a Hilbert space, and if A+=0 ts an eigenvalue of a compact 
operator A¢L(H), then N(A —AE) and N(A*—AE) are of equal (finite) dimension. 


Remark 2. The theorems stated in this section are the abstract formulation of the so-called 
Fredholm alternatives for integral equations we shall discuss in the following section. 


21.3. | Fredholm’s Integral Equations 
21.3.1. The Adjoint Integral Operator 


If 2 is a bounded domain in R,, then the operator K, 
(Af)(x) =A K(x, y) fly) dy , 


is the mtegral operator of Subsec. 20.2.5., where we now assuine that the kernel 
K(x, y)€L,(QX 2). Then, by Theorem 20.2.5/2, K€ L(L,(Q)) is a compact 
operator. Hence it is possible to apply the theory of Riesz and Schauder as 
described in Sec. 21.2. The first problem is to determine the adjoint operator 
K*€L(L,(Q)). 


Theorem. IW7th the above assumtions, one has 
(A*/)(x) ei Ky, x) fly) dy for fel (Q). 


Remark. The equations of Sec. 21.2. now take the form 
S K(x, y) Hy) dy —Af(x) =h(z) . (1) 
2 


Here h(a) €L,(2) and K(a, y)€ £.(QXQ) are given, and A+0 isa complex number. Desired 
is a function /(#) € Lo(Q) such that (1) is satisfied. Equations of this kind are called Fredholm 
integral equations of the second kind. The theory of Sec. 21.2. and the above theorem show 
that, in addition to (1), the adjoint equation 


J Ky, &) gly) dy —Ag(x) =h(x) (2) 


is of interest too. 
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21.3.2. Fredholm’s Alternatives 


We now transfer the abstract theory of See. 21.2. to the operators and equa- 


tions of Subsec. 21.3.1. All notations shall have the same meaning as in Subsec. 
al a al . 


Theorem 1. The spectrum Sx of the compuct integral operator K€ L(L(2)) of 
Subsec. 21.3.1. consists of the point A=0 and of an at most countably infinite number 
of nonzero eigenvalues of finite multiplicities which can accumulate only at the 
potntiA=0. 


Remark 1. The theorem is an implication of Theorem 21.2.3/1 and Subsee. 21.3.1. 


Theorem 2. 7f 2+0 ts not an eigenvalue of the integral operator K, then the Fred- 
holm integral equations of second kind (21.3.1/1) and (21.3.1/2) have a unique 
solution f(x) € L(Q) and g(x) € Lo(Q) for every function h(x) € Lo(Q). 


Remark 2. The theorem is an implication of Theorem 21.2.2(b) and Remark 21.2.3/1. 


Theorem 3. 7f 4+0, then the following alternative is true: either the Fredholm 
tntegral equation of second kind (21.3.1/1) has a unique solution in L(Q) for 
every function h(x) € L(Q) or the homogeneous integral equation 


Py K(x, y) f(y) dy —A4f(x) =0 i. 


has a non-trivial solution in Lo{Q). 
Remark 3. This theorem has also been transferred from Remark 21.2.3/1. 


Theorem 4. Jf 4+ 0, then the homogeneous integral equations (1) and 
J KY, 2) gly) dy—4g(x)=0 (2) 


have an equal (finite) number N of linearly independent solutions in L(Q). Let 
f(x), +) f(x) be linearly independent solutions of (1), and let g,(x), ..., gx(x) be 
linearly independent solutions of (2). Then (21.3.1/1), with h(r)€L.(Q), has a 
solution in Lo(Q) if and only vf 


f R(x) g(x) dr=0 for k=1,...,N. 
2 


Accordingly, (21.3.1/2) with h(x) € Lo(Q) has a solution in Lo(Q) if and only tf 
TiGidr—=O0 jor kw=1,.....N. 
2 


Remark 4. The theorem is an implication of Theorem 21.2.2 and Theorem 21.2.3/2. The 
theorem is of interest only for those 4=-0 which are cigenvalues of K. For other numbers A 
the theorem is also true, but it does not give any new information as compared with the 
previous theorems. 


Remark 5. The theorems of this section are ealled Fredholm’s alternatives for integral 
equations. 
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oo. Distributions 


22.1. Fundamental Concepts 
22.1.1.- Introduetion 


A continuous function in R, does not necessarily have continuous partial 
derivatives of first order, a continuously differentiable function does not neces- 
sarily have continuous partial derivatives of second order, and so on. These 
statements are elementary and can be easily supported by examples. But from 
the physical point of view these statements are regrettable. On the one hand ma- 
ny physical theories are based on partial differential equations, but on the other 
hand the mathematical idealizations of physical problems in a natural way 
lead to functions that are not differentiable or even discontinuous. This can 
be demonstrated by many examples in electrodynamics and quantum mechan- 
ics. There are two possibilities to escape this dilemma. One may either refor- 
mulate the fundamental laws of corresponding physical theories or look for a 
new basis, an extended concept of function, a new concept of differentiability 
ete. Both ways are practicable. In the first case one may, for example, replace 
partial differential equations by integral identities that are also valid for more 
general functions. But here one frequently encounters new difficulties, not to 
mention the fact that inathematicians and physicists prefer to calculate with 
partial differential equations rather than with integral identities. The other 
way leads to the theory of distributions (generalized functions): the concepts 
of a function, of differentiability etc. are extended, while the partial differential 
equations are retained. Many technical difficulties and (from the physical 
point of view) artificial constraints on the partial differential equations of 
theoretical physics are avoided by this new calculus. Anyway, the physicists 
hadn’t bothered so much about such “mathematical subtleties” (and very 
rightly so, as we know today). After all, it seems to be a sort of quality charac- 
teristic of good theoretical physicists that even on unexplored mathematical 
terrain they instinctively avoid or jump over the pitfalls (including the well 
masked ones) that are abundant there. Less competent theoretical physicists 
collect in these pitfalls, but will deny this vehemently. Here it is recommendable 
to wait until mathematics will reclaim these areas with heavy clearing tools and 
mark “popular” hiking trails. A mathematician may either loftily turn up his 
nose, or observe with astonishment and admiration (though with slight dis- 
comfort and a trace of jealousy) that results which are equally striking and 
spectacular from both the physical and the mathematical point of view come 
out at the end of a way which, for the present, lacks full inathematical support. 
In this chapter we shall develop the fundamentals of the theory of distributions, 
while applications of this theory to partial differential equations will be de- 
scribed in the following Chap. 23. 


» 
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22.1.2. The Spaces D(Q) and D’(Q) 


If 2 is a domain in R,, then we denote by D(®) the class of all complex 
functions with compact support that are differentiable on Q up to arbitrary 
order (another common notation is Cy(Q)). The definition of a function with 


compact support has been given in Def. 14.6.4/2. Theorem 14.6.4/2 has shown 
that D(Q) is dense in L,(@), with 1= p<, 


Definition 1. A sequence of functions {or(x)}g-1 C D(Q) converges‘ in D(2) to 
g(x)ED(Q) if all the supports of the functions @x(x) are included in a compact 
subset of Q and if D*q;(x) converges uniformly to D*g(x) for every multiple index a. 
Remark 1. For the convergence in D(Q) we also write Pk nay? Thus, what is required is that 
there exists a compact set w, with wcQ (depending on the sequence {g;}) such that 
SUPP 9; Cw (Fig. 22.1). The concept of a support, supp, has been explained in Def. 14.6.4/2 
and in Remark 14.6.4/2. Since w is closed and bounded and Qis open, w hasa positive distance 
from the boundary 02 of Q. Since g(a) is the limit approached by uniform convergence of 
pz(x), one also has supp g Cw (to which there corresponds the multiple index «=(0, ..., 0)) 
The notation D*p for a multiple index « was introduced in Remark 8.1.2. 


Definition 2. A distribution is a complex continuous linear form T over DON: 
That means, every ¢ €D(Q) ts assigned a complex number T(p) with the following 
two properties: 
(1) For pe D(Q), pe D(2), and the complex numbers 24 and p, 
T(Ap + wy) =AT(p) + uT (yp) (linearity). 


(2) Coney 7 implies that Tp) > T(¢) (continuity). 


The class of all distributions over D(Q) is denoted by D’(Q). Here T= T> means 
that T(¢)=To(p) for all p€ D(Q). 

Remark 2. This is the fundamental concept in the theory of distributions, the elements of 
which were developed by S. L. Sobolev in 1936, while L. Schwartz completed the theory 


systematically in the late forties and fifties. Since that time distributions play a fundamen- 
tal role in modern analysis as well as in some parts of theoretical physics. 


22.1.3. Examples of Distributions 


If Q is a domain in R,, then /°°(Q) denotes the class of all locally integrable 
complex functions on 2. This means: if o is a compact subset oe and if 
4.(x) is the associated characteristic function, then f(x) belongs to iy (Q) if and 
only if yo(x) f(x)€L,(2) for every set w of this kind. For example, one has 


4 ¢ pee(gy, where 2=(0, 1), but 61 (0). The fact that f(x) belongs to L/P°(Q) 
t 


does not include any information about the boundary behaviour. As for 
the spaces L,(2), we make the convention that two functions f(x) and g(x) 
are considered to be identical elements of L/°°(Q) if they differ from one 
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another only on a set of the Lebesgue measure zero. For this point, refer to 
Subsec. 14.6.1. 


Lemma. Jf f(x)€ ae and tf 
Boies odr—O0" forall SogieD(2\. (1) 


then f(x) =0 for almost all x€ Q (2.e., the zero element in L'0°(Q)). 


Remark. Since g(2) has a compact support, the integral in (1) is meaningful. The lemma is a 
generalization of the fundamental lemma of the calculus of variations as given in Subsec. 
Hill 2. 


Regular distributions (or distributions of the function type): If Q is an arbitrary 
domain in R,, and if ns € L'?P*(Q), then it is easy to see that 


Ty(g)= f f(x) plz) dr for px)eD(Q) (2) 
fulfils the conditions of Def. 22.1.2/2. Hence 7;¢€D’(Q). On the other hand, if 
ge LP(Q), and if T/(~)=T,(~) for all p€ D(Q), then it follows from the above 
lemma that f=g in L°(Q), Thus, according to the above convention, the 
correspondence between /€ L‘°°(Q) and 7;¢D’(Q) is one to one. In this sense 
we identify L/P(Q) with the corresponding distributions (Fig. 22.2). Instead 


of Te D’(2) we then write for abbreviation f(r) ¢.D’(Q). Distributions of this 
kind are called regular. 


(ky 


§-Distribution (or Dirae’s distribution): A distribution which is not regular in 
the above sense is called singular. If a¢€Q, then 6,, 


daly) =y(a) forall pEeD(Q), 

is a singular distribution. If 0€ 2, then one also writes 6 instead of 6). 

Other examples: If a€ Q, and if f(x oe then it is easily seen that 7, with 
T(p)=D*y(a) and T(p =i f(x (aida for ee Dio 


Fig: 22.2 


are distributions in D’(Q) for every multiple index «. 


22.1.4. Operations with Distributions 
Definition 1. Let 2 be a domain in Ry. 
(a) (Differentiation). For TED (Q) and a multiple index a, 
(D*P)(p) =(—1)"! P(D*y) for pe D(Q). 


(b) (Multiplication). For a complex function a(x) that is differentiable on Q up to 
arbitrary order, and for T€D’(Q), 


(OP)(p) = (a(x) T)(~) = Tal) p(x) for pe D(Q). 


ox 
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(e) (Addition). For T,€.D'(Q), Ty €D’(Q), and for the complex numbers 4, and dy, 
(AyD y+ AT )(p)=Ay Tyg) + A2T lp) for peD(Q). 
Theorem 1. If all quantities have the meaning stated in the above definition, then 
DAT ep), aTeED(Q) and AyPy+a_T.€D(Q). 


Further (in the sense of the above definition) the following relation holds 


tyY 


yy ou Gh 4 . 
(a(x) ers ] se Bx; for T= i sees Ml (1) 


LF 
and D**®T = D*( DPT) = D?(D*T), Here x =(a, --+,%) and B=(Bj, 5 Bn) are mul- 
tuple indices, and a+B=(a«,+ 64; -- &n+Bn)- 

Remark 1. Hence it is possible to differentiate distributions up to arbitrary order, and to 
multiply them by smooth functions. If PEL P(Q), and if 7,;¢D’(Q) is the corresponding 
distribution, then 7,;=a7;. where a(z) is digieneneinye up to arbitrary order on Q. If 


f(x) € ploe( 2) has seen pout al derivatives up to the order |«], which shall be denoted by 
D*}(x), then D*T;,= =P ps, tay" . In the sense of the identification of LIQ) with the regular 


distributions, the previous concepts (multiplication by functions, differentiation) thus agree 
with the newly introduced ones. 


Remark 2. It follows from (1) that for D*(a(z) 7’) the usual Leibniz formula for the derivative 
of a product is applicable. 


Remark 3 (examples). If 2 is a domain in FR, and if a€Q, then 
(D*0q)(p) =(—1)!"|D%p(a) for pe D(Q). 

Moreover, if p(z) is a function differentiable on 2 up to arbitrary order, then 
(p(x) 4a)(~) = pla) (a) - 

We consider another example. Let Q= R,, and let 


y(t) ={i for O<t<o 


be the Heaviside function . 
0 for —o<t=0 


If y(t) €D’(£,) is regarded as a distribution over A,, then y’=6 (where y’ is the distribution 
derivative). 

Lemma (partition of unity). Let B be a bounded dos setin Ry, and let {O,}p_, be a 
system of bounded open sets in Ry, such rae cu Oz (Fig. 22.3). Then there exist 
real-valued functions py, €.D(Ox) such that 2 mele) )=1 for x€ B. (Here the functions 


g(x) are continued to vanish outside of O;.) 


* x 
oo is Oo ‘ 
* ! 
, eal 
‘ 4 
2 “4 
’ , 
i 0; , 
: ie 
s Foe . 
eee Fig. 22.3 


Remark 4. Thus the function f(z) =1 for «¢€ B can be decomposed into smooth partial fune- 
tions, each of which is different from zero only in a small domain. For the above functions 
it is also possible to achieve that 0=;(xz) =1. The meaning of this important lemma is that 
it is possible to reduce global properties (of functions and distributions) to local oncs. This 
is exemplified by the following theorem. 
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Theorem 2. Let Q be a domain in Ry, and let {Ox}e_, be a system of nonvoid bounded 

domains in Ry, such that U O,=Q. If the restrictions of two distributions T ¢ D’(Q) 
b= 

and T,€D'(Q) coincide on Ox, k=1, 2, 3, ..., then as, 


Remark 5. If 7 ¢D’(Q), then also T €D(O;), because the restriction T(p) to p€D(O;z) yields 
a distribution over O,. In this sense, the restriction of a distribution over Q to a subdomain 
of £2 again yields a distribution. The theorem then says that (the global property) 7',= 
=T,¢€D(Q) holds if and only if (the local property) 7,=7,€D‘(O,) for k=1, 2,... The 
theorem is proved by means of the above lemma. 


Definition 2. Let Q bea domain in Ry, and let TED (2). Then supp T (the support 
of T) is the set of all points x€Q for which the restriction of T to QQ) fy | lr—y| <6} 
is different from the zero distribution for ever y positive number 6. 


Remark 6. Hence the method of checking x€supp 7’ is as follows. One asks whether for 
every positive number 6 there exists a function PoE D2 {y | |z—y| <6}) such that T(p6) + 
+0. If this is the case, then « belongs to supp 7’, otherwise not. supp 7’ may also contain 
points «€92 that are not in Q. It is easily seen that supp 7' isa closed set in R,. The restric- 
tion of 7’ to Q\(QM supp 7) is the zero distribution. This follows from Theorem 2. Moreover 
it is seen that Q\(QN supp 7’) is the largest domain with this property. 


Remark 7. If f(x) € L'0°(Q), then we can form supp f/ by Def. 14.6.4/2 and supp f=supp 7’; by 
Def. 2. One has supp 7';Csupp f, but it may happen that supp / with f as a function includes 
supp 7';=supp f, with f as a distribution, as a proper subset. In the first case every point x 
with /(z) +0 is considered, whereas in the second case points of a set of measure zcro do not 
play any role. For continuous functions the two concepts coincide. 


Remark 8. If Q is a domain in R,, and if a€Q, then supp D%6,= {a} for every multiple index 
a. If y(t) is the Heaviside function referred to in Remark 3, then supp y=[0, ~). 


22.1.5. The Space E’(Q) 


Definition. Tf Q 7s a domain in Ry, then E’(Q) is the class of all distributions tn D’(Q) 
whose supports are bounded and lie in Q. 


Remark 1. As supp 7c for arbitrary 7'€.D’(Q), the above requirement that supp 7’'cQ 
means an essential restriction. Since, moreover, supp 7 is bounded and closed for TEE (Q), 
in this case supp 7' has a positive distance from the boundary 02 (Fig. 22.4). This is also the 
essential property required for the proof of the following theorem. 


Cy Supp T 


Q 
82 Fig. 22.4 


Theorem 1. 7 Q ds a domain in fin, and if T¢eE'(Q), then there exist a natural nume- 
ber N and a positive number C such that, for all pED(Q), 
IT(p)l=Csup > ID*p(xr)| . (1) 
tEQ [aj SN 


Remark 2. Conversely, if an arbitrary linear form 7 over D(Q) can be estimated in the form 
(1), then it 1s immediately obvious that TED (Q) 
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Theorem 2. If Qs adomainin Ry, and ifae Qand supp 7'= {a}, then there exist a 
natural number N and complex numbers uz such that 


(ie Wiles ee (2) 
je|3." 


Remark 3. If 7 lias the form (2), then 7 is either the zero distribution, or supp 7'= {a}. In 
other words, with the exception of the zero distribution, supp 7’= {a} if and only if 7’ has the 
form (2). 


22.2. The Fourier Transform and the Spaces S(R,,) and S’(R,) 


22.2.1. The Space S(R,,) and the Fourier Transform 


Definition 1. The space S( Rn) consists of all complex functions g(x) that are differ- 
entiable on R, up to arbitrary order and have the property that 


elle = sup (1+ |xl*) > |D*p(2)| << (1) 
zrERy, ja] sl 


(oe Nee, wom 1, 2, 2. 27 (OJ) CA Rhy) and pES(Ry), then we write 


Tia? (convergence in S(R,)) if, for all k and 1, |lgm— lez >0 as mo. 
i n 


Remark 1. |g||,,7 is a norm. Thus S(R,) is characterized by a countably infinite number of 
norms. Spaces of this kind are also called locally convex. S(R,) is called the space of rapidly 
decreasing functions. The meaning of the designation will be clear from a look at (1). One 
has D(R,) CS(R,). A function that belongs to 8(R,) but not to D(R,) is e- 2”. 


Detinition 2. Jf péS(Ry), then Fe, with 
. n 


(Fox) =(2n) ® ferKehe(é) dé, xe Rn, (2) 
Ry 


is the Fourier transform of ¢. 
Remark 2. If Bala .., €,) and #=(2), ..., ,) then we now set (in order to avoid confusion) 


n 
(x, &)= D x,&,. Since p€S(R,), it is easily seen that (2) exists for every x+€ Ry. 
k=1 


22.2.2. Properties of the Fourier Transform 


. . a 
Aompre vi0ue Waewemwnilonr — tion, wiere V=(0\..,0,)6R, and a= 
(2), 2.4%) Isa multiple index, 


Lemma lL. (a) Jf p€S(R,), then also Fp S( Ry), and 
D*(Fe) =(—i)"! F(a*p) as wellas &* Fo =(—i)*! F(D%q). ad) 


(b) Lf {pehF1 Cc S(Rn) and tf py —> ¢, then Fo, —> Fo. 
S(Ry) SCUty) 
Remark 1. According to (1) the Fourier transform interchanges derivatives D* with inulti- 
plications x*. This already suggests that it is possible to reduce partial differential equatious 
to algebraic equations by means of Fourier transforms. 
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eee 


Lemma 2.(a) fie *)j=e 7, 
(b) If pES(R,) and e+0, then 


F(p(e&))(x) =e-"F (p(E))(we7!) . 
_ 12? 


Remark 2. Consequently, e 2 is an eigenfunction of F. 


Theorem. The Fourter transform F provides a one to one mapping of S( Rp) onto 
S(R,). The inverse mapping F—! is called inverse Fourier transform, being given by 
Nn 


(F!@)(x) = (2x) ® f ele (E) dé, peS(Rn) . (2) 


Remark 8. A mapping “onto” means that the range of F is the total S(R,), ie., F(S(Rp)) = 
=S(R,). The formula (2) differs from (22.2.1/2) only by a sign in the exponent. For y€S(Rz). 
(P19) = FW Fp) = 


22.2.3. The Space S’(2,,) 


Definition. A tempered (or slowly increasing) distribution is a complex continuous 
linear form T over S(Ry). This means that every ¢€S(Rp) is assigned a complex 
number T(p) with the following two properties: 

(1) For ~ES(Rn), pe S(Rp) and the complex numbers 4 and p, 


T (Ag+ py) =AaT(p)+ uly) — (linearity) . 
(2) go, ——> sR)” implies that T(¢x) > T(p) (continuity). 
The class of all tempered distributions is denoted by S’(Rn). Here T= T. means 
that T \(~)=To(¢) for all pe S(R,). 


Remark 1. This is the analogue to Def. 22.1.2/2. go, aa y implies that mp, RD gy. Hence it 
follows that the restriction of 7 €S’(R,) to D(R,) belongs to D’(R,). In this sense one thus 
has S’(R,)c D(£,). On the other hand, if 7¢#’(R,) and if pe DR n) ls afunction such that 
y(“)=1 in a neighbourhood of supp 7, then 7g) = T(gy) is meaningful for yp €S(R,). It can 
be verified dna T(¢) is independent of y and that T'€S’(R,). Hence 

EY R,) CS Ry) CD Ry) 


in the sense of the above interpretations. 


Theorem. Zf T'¢S’( Ry), then there exist natural numbers k and Las well as a positive 
number C, such that for all pe S( Ry) 


IT(@lsC |lelles - (1) 


Remark 2. This is the analogue to Theorem 22.1.5/1. Conversely, if an arbitrary linear form 
T over S(R,) can be estimated in the form (1), then it is immediately seen that T'€S’( R,) is 
a tempered distribution. 


Remark 3 ee as with 
=e a)dz, pES(R,), (2) 


Fol.) oe belongs to S’(R,). Here L,(R,) has the same meaning as in Subsec:- 
14.6.4., where /(z)€L,(R,) can now also be complex-valued (cf. Remark 20.1.1/2 and Re- 


OEE 


vO 
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mark 17.3.2). By analogy with the interpretation that a n) CD'(R,), as stated in Sub- 
sec. 22.1.3., we now have L pi kn) CS(R,) for spss. i in formula (2) one chooses poly- 
nomials in #,, then one lel ise obtains tempered distributions. Moreover it is observed that 
the increase of |/(a)| as |a| - « must not be too rapid if T, is to belong to S’( R,) (this accounts 
for the terminology in the above definition). 


22.2.4. The Fourier Transform in S’(2,) 


Definition. 7f TES’( Ry), then FT defined by 
(FT)(p)=T Ee) for peS(Rn) 

vs called the Fourier transform of T, and F-'!T defined by 
(FAP\(¢)=T(F-'9) for peS(Rn) 

vs called the inverse Fourier transform of T. 

Remark 1. (F7)(y) and (F~!7)() are linear. If |T(p)| Se |lp||,.z, then it follows that 


(FT)(p)| =|2(F9)| Se (Folens loltns ye: . 
Hence FT €S’(R,), and analogously F-!7'€S’(#,). Thus one again obtains tempered distri- 
butions. 


Remark 2. If pc S(f,), then we have two definitions for Fy, namely Def. 22.2.1/2 and the 
above definition, provided that y= 7, is interpreted as an element of S’( R,). It turns out that 
both definitions lead to the same result. In other words, the above definition extends the 
Fourier transform and the inverse Fonrier transform from S(R,) to S’(R,). 


Theorem. The Fourier transform F provides a one to one mapping of S’(R,) onto 
S’(R,). Further, 


aie orp dees Cli). 
Remark 3. This is the extension of Theorem 22.2.2 to S’(f#,). In particular F—! also provides 
a one to one mapping of S’(R,) onto S’(R,). 7 


Remark 4. If f(z) ¢€£,(£,,), then by Remark 22.2.3/3 it is possible to regard f(x) as a regular 
distribution in.S’(R,). Then Ff is also a regular distribution in S’(f,), and 


(F(z)=2r) 2 feHeHy(g) dé. 


iB 
Remark 5. The following example is of interest for the three-dimensional wave equation. Let 
Valo)= JS plx)ds for peS(ks3) and #>=0. 
ae 


One has V p€ ER), and hence also Vp€S’(R;). The support of the (singular) distribution 
Vp is the sphere {x | |z| =}, R>0. The Fourier transform F Vp» is a regular distribution, 


and 
/ sin f | a 
FVp =k 


lacy 


| 
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22.2.5. Other Properties of Fourier Transforms 


Theorem 1. (a) Jf 7 ts « distribution with supp T = {0}, then FT ts regular and 
equals a polynomial. 
(b) Tf TES’ Ry), then for every multiple index 
F(D*T)=i"le"FT and F(atT)=ilD*(FT) . 
Remark 1. By Theorem 22.1.5/2, part (a) of the above theorem follows from 


n 
F( > a,D%))= Dd (Qn) 2iltla,v. 
als V i@loN 
ae 
Hence in particular one has F(6)=(27) 2. 
Remark 2. Part (b) of the theorem is a generalization of Lemma 22.2.2/1. lf 7¢€S(R,), then 
xT and D*T also belong to S’(R,). 


Theorem 2. The Fourier transform F and the inverse Fourter transform F-! are 
unitary operators that map the Hilbert space Ly( Ry) onto itself. 

Remark 3. This theorem is of fundamental interest. The concept of the unitary operator has 
been described in Def. 21.1.6. In the notation used there, one has V=F (or V=F7-'!) and 
DV) =R(V) =L,(F,). Thus, what is meant here are the restrictions of F and F~! (which are 
originally defined on the total S’(R,)) to Lo(R,). 


HD ac Tensor Products and Convolutions 
22.3.1. Tensor Produets 


TB eg acy) < Fig — (Yigg iy) ee en) — ae 
Ym)€ Rn+m- We consider distributions 7'¢€D’(R,) and S¢€D’(R,»). To characterize 
the dependence on the variables, we sometimes also write 7’, instead of 7 and Sy 


instead of S. 
Lemma. If Sy€D/(Rm), and if pl, y)ED(Ru+m)s then la) =Sy( ole, y)) €D(Rn), 


and 
D*y(x)=S8,(Dig(ax, y)) for every multiple index a. 


galaxy y) Daa p(x, y) implies that 
m 
S, (g(x, y)) = vl) or, PI) =Si (g(x, y)) - 
ae y) 


a 


Oe 


tiation is carried out only with ‘rope! tO 2,...,%,, but not with respect to y, ..., ym. AN 


Remark 1. Here Dig(x, y) = . hence the index x indicates that the differen- 


analogous statement holds for ey, If there is no fear of confusion, we prefer to write D* 
instead of D% or Dy, which then means that the differentiation can be carried out with respect 


to all variables that come into question. 


Theorem. /f TED'( Ry) and SED'(Ry), then there exists exactly one distribution 
UED'(Rnim) such that, for all p(x)ED( Ry) and all p(y)€ED( Ry), 


U(p(x) y(y)) = T(p(a)) S(p(y)) - (1) 


9 a Oe z y 
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For o(x, y)€ D( Rasm), this distribution U can be calculated from 
U(o(x, y)) =8S,(Te(olx, y))) = T(Sy(olx, y))) (2) 


Remark 2. The lemma shows that the terms formed in (2) are meaningful. The linearity of U 
is obvious, and so is property (1). What remains to be shown is the continuity of U and its 
uniqueness. 


Definition. 7f TE D’(R,z) and SED'(Ry), then UCD’ (Ry 4m), as given in the above 
theorem, is called the tensor product of T and S. We write U=T@S. 


22.3.2. Properties of Tensor Produets 


Theorem. ff TED’ (Ry), SED’ (Rm), and V€D’( Ri), then 
Tr®Sy=SyQTz (commutativity) , al 
(TES)@V=TE(S@V) (associativity) , (2) 
EO ey) — (0 1 On ead (3) 
Upp oe (007) ee eh ,e eecciippl’, yesupp os) s 


Further, the tensor product is continuous: from T;,€ D’( Ry), TED’ (Rn), and T;(~) + 
+T(¢) for all ¢€D(R,) as k-@, tt follows that 


(Tr @S)(0)-(T@S)\(e) forall e€D(Razm) and kro. 


Remark 1. Formula (1) is a reformulation of (22.3.1/2); the associative law as well as (3) 
are also easily obtained from this representation. 


Remark 2. Let f(z) €L/0°(R,), and let g(y)€LIP( Rm). If f and g are interpreted as regular 
distributions, then {/@g is also a regular distribution, and 


(f @g)(x, y) =f(x) g(y) for x€R, and yCRp. 


Thus the tensor product of distributions is a generalization of the point-by-point product 
of functions. 


22.3.3. Convolutions 


Lemma. /f T¢€D’( Ry), SED (R,), and if pe D( Ry), then 
supp [@ (<+y) (T@S8)] 
-E{(z, y) | (x, y)€ Ron, xEsupp T, ycsupp S, r+yesupp 9}. 


Remark 1. supp 7 has been introduced in Def. 22.1.4/2. According to Remark 22.1.4/7 it is 
of no account whether supp @ is understood in the sense of Def. 14.6.4/2 (as a function) or in 
the sense of Def. 22.1.4/2 (as a distribution). 


Definition. Let 7 ¢€D’( Ry) and S€ D’(R,). If for every function gy € D( Ry) the set 
{(x, y) | (x, y)€ Ron cesupp T, ycsupp S, r+ yEsupp 7} a 
ts bounded in Roy, then TS defined by 
(T*S)(p) =[plety)(T@S) ole, y)) for ee DUtn) ; (2) 
is the convolution of T and S. Here o(x, y)ED( Ry), with o(x, y)=1ina neighbour- 


hood of the set (1). 
17* 
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Remark 2. The above lemma shows that 47 =supp [y(z +y)(7'@8)] is included in the set 
(1). Hence I is a closed and bounded set in Ry,,. But then there exist functions 9(%, y) € D( R2,) 
such that o(#, y)=1 in a neighbourhood Q (domain) of JZ, i.e., M CQ. (This is meant by the 
formulation in the above definition.) [t is easily verified that (2) is independent of the choice 
of such functions 9g. 


29.3.4. Properties of Convolutions 
] 


Theorem L. (a) Let the assumptions of Def. 22.3.3 be satisfied. Then TS €D(R,). 
Further SxT exists (and belongs to D’( Ry) too), and TxS=S*T (commutativity). 
Ifa isa multiple index, then the convolutions D*TxS and TxD*S exist too (and be- 
long to D’(R,)), and 


DS) eS) 1S ab) 
(b) If the convolutions TxS, and TS» exist in the sense of Def. 22.3.3, and if 
A, and Ay are complex numbers, then Tx (AyS + AgSq) exists, and 
T (AS y+ AgSo) =A4( T'S 1) + Ao( T'S) « 
Remark t. One has supp D*7’Csupp 7. Hence it is clear that all distributions in (1) exist 


Theorem 2. (a) /f TED’(R,) and SCE'(R,), then TxS exists (in the sense of Def. 
2 ne 


(T'*S)(p) =Sy(n(y) Te(plety))) = Tel(Sy(v(a+y))) , (2) 
=| 


where 7€ D( Ry) is a function such that y(x): 2 a neighbourhood of supp S. 


(b) For every distribution Te D’( Ry), 
a Oe (3) 
Remark 2. The space #’(R,) has been introduced in Subsec. 22.1.5. In particular, 6€#(R,), 
where 6 has the same meaning as in Subsec. 22.1.3. Since supp S is bounded for S€E£’( R,), it 
is easily seen that, independently of 7 ¢€D’(R,), the set (22.3.3/1) is bounded. Thus one can 


form 7*«S=Sx*7’, and all the terms formed in (2) are meaningful. If one sets S,=6 in the 
last term in (2), then one obtains (3). 


Theorem 3. For f(x)cLie°(R,) and g(x) ce Lk°(R,), let supp f and supp g be the 
supports of these functions in the sense of Def. 14.6.4/2. If for every positive number 
N the set 

{(x, y) | (@, y)€ Ron, let yl SN, xesupp f, yesupp g} 
is bounded, then 

Ceo NID NI ay SAT ERT (4) 

‘n ‘2 

exists for almost all xe R,, and h(x) € LP (R,,). Lf f and g are interpreted as distribu- 
tions in D’(R,,), then the convolution fxg exists. It is a regular distribution, and one 
has (fxg)(xr) =h(x) almost everywhere. 


Remark 3. Formula (4) was the starting point for the theory of convolutions of functions. 
Convolutions of distributions have to be regarded as a generalization of this original concept. 
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23. Partial Differential Equations and Distributions 


Boll. Fundamental Solutions 
23.1.1. Basie Properties 


If a.(x) are complex functions that are differentiable on R,, up to arbitrary 
order, and if Uc D’(R,,), then we ask for distributions 7'¢ D’(R,) such that 


Se eer =O . (1) 
{z|=m 
In other words, desired are distribution solutions of partial differential equa- 
tions. 


Lemma. // U and T are regular distributions, and if T has (classical) partial deriv- 
atives up to the order m, then T 7s a classical solution of (1). 


Remark 1. Since 7 =t(x)€C™(£,,), it follows from (1) that U=u(a) €C(R,). A classical solu- 
tion then means that (1) is satisfied point by point. Conversely, every classical solution of (1) 
is also a distribution solution. So the above problem is the generalization of problems like 
those treated in Chap. 19. 


Definition. // a,(x) are complex functions that are differentiable on R, up to arbi- 
trary order, then GE D'(R,) is called a fundamental solution if it satisfies 
Sea) D'G_sé. 
jej=m 


Remark 2. Speaking more strictly, one should say: a fundamental solution with respect to 
the differential expression on the left-hand side of (1). Here 6 is the 6-distribution defined 
in Subsec. 22.1.3. 


Theorem. Let the differential expression >, a,D*T, with constant coefficients aq, 
have a fundamental solution G. aaa 
(a) If for UE D'(R,,) the convolution T=GxU exists, then T is a solution of 


Sica =. 
|2|m 
(b) Jn the class of the distributions that allow a convolution withG, > a,D*T = 
=U has at most one solution. ja| =a 


Remark 3. Convolutions were described in Subsees. 22.3.3. and 22.3.4. From the properties 
stated there it follows that 


> a, DGs U)=( > aD) * U=b*U=0U, 


jalsm lajsm 


which proves (a). The theorem also shows the importance of fundamental solutions. Of 
interest is the proof of the existence of fundamental solutions and their explicit construction. 
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23.1.2. The Laplace Equatiou 


Theorem 1. The regular distribution 


|--$— oes jor nv=2, 
G(x) =: (n — 2) lo,| Ia (1) 
In [zl jo ee 


27 
ws a fundamental solution for A, which means that AG =6. 


2 


n 
Remark 1. Confer Def. 19.2.1. As stated there, A= > agi is the Laplace clifferential oper- 
ator. Here we always assume that »=2. Cee 


Remark 2. We thus can apply Theorem 23.1.1(a). For the following theorem it is of inter- 
est to point out that G belongs not only to D’(R#,) but even to 8’(R,). A polynomial P(«) = 


= > 5,x* is called harmonic if AP(x)=0, Here (as previously) we set 2*=a{! ... ay”. Ex- 
je[=m 

amples of harmonic polynomials are 1, 2), ..., #,, but also aj —x3 for n=2, ete. 

Theorem 2. (a) [f UCE(R,) then UxGeS’(R,) is a solution of A(UxG) =U. Con- 

versely, if TES (R,) is a solution of AT =U, then T can be represented as 


T = UxG + P(x), (2) 


where P(x) is a harmonic polynomial. 
(b) Lf, in addition, U=u(r)€E(R,) is a regular distribution, then (2) can be 
written as 
P=ilx)= f Glx—y) uly) dy+ P(x), (3) 
Ry 


uhere T=t(x) is also a regular distribution. 


Remark 3. Since Uc#(R,), U * GED R,) exists according to Theorem 22.3.4/2. As stated 
in the above theorem, it is possible to prove a much stronger assertion, namely that U « G¢é 
€S"(R,). (2) or (3) then give a complete survey of all solutions of AZ’ =U that belong to 
S‘(R,). (b) says that u(x) €L,(R,) has a bounded support, that the integral in (2) exists for 
almost all 7€ R,, and that i(a) € Elec’ B,) AS’(R,). Compare (2) with the Newton potential 
referred to in Theorem 19,2.5/1. Differentiability assumptions of, say, the form u(x) €C2(R,) 
have now become superfluous. 


23.1.3. The Heat Conduction Equation 


Theorem 1. The regular distribution 


v7) nv \z] 


G(x, t)= 2 eet ig = eee 
O for (20, re h,, ’ 
: : : 0G 
isa fundamental solution of the heat conduction equation, that means a G0. 


Remark 1. One has (a, t)E R14, where s=(KH, ..., tr )ERy and ¢€R,. Further G(z, t)¢ 


n 92 
CR) SCD (Raa). As stated in Chap. 19., A= > me does not contain any deriv- 
kai OLE 


atives with respect to t. Compare G(a, t) with s(x, t) in Lemma 19.4.1. We are now able to 
apply Theorem 23.1.1 (a). 
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Theorem 2. 7f UCE’(R,), and if G(x, t) is the function given in Theorem 1, then 
oT 
P=UxG % a solution of the differential equation ——AT=U. If additionally 


ot 
U=u(x, t) is a regular distribution, then T= T(x, t) is also a regular distribution, 
and 


‘ _le=yP 
ee re ane) 4({—7) 
Digi=ta a = | i ee ents) iit (1) 


ae Ry (t —t) 2 


Remark 2. According to Theorem 22.3.4/2 it is clear that U *G@ exists. The last part of 
the theorem means that for almost all (7, )€R,.; the integral in (1) exists and 7'(x, t)€ 
EL'°(R,.1), provided that u(x, f)€Ly(R, 41) has a bounded support. Moreover (1) shows that 
for the ealeulation of 7'(x, f) one only needs t-values with + =¢. This is physically plausible if 
tis interpreted as the time variable. 


23.1.4. The Wave Equation 


In Sec. 19.3. we discussed the wave equation in 1, 2, and 3 dimensions. Here 
we shall largely confine ourselves to the physically interesting three-dimensional 
case. 


Theorem (a) For x= ie x», Xx)€ Ry and t€ hy, the singular distribution GE D’(R,), 


Ge) =~ = ft foaled ded, peD(R,), (1) 
et 
is a fundamental solution of the three-dimensional wave equation, that means 
0G 
— —AG=6 
él? 


(b) If UED'(R,), with supp Uc {(z, t) | (2, NER, (20) = Bi, then T=UxG@ 


exists and vs a solution of the differential equation ap 


supp Tc Rt. If additionally U =u(.x, t) vs a regular distribution, then T= T(x, t) 
7s also a regular distribution, and 


le { u(y, ¢—lx—yl) ree aie 
ie ja oe by 
0 or LO" 
Remark 1. Formula (1) ean also be written as 
Gp) =— | eee dx, pe DR). (3) 
3 


Hence it follows that supp G={(a, ft) | (x, 1) € R,, £=|a)}. Since this ts a set of the four-dimen- 
sional measure 0, it follows that G in (1) must be singular. In proving that G is a fundamental 
solution, one largely makes use of Remark 22.2 4/5. It is now easily seen that the convolu- 
tion U x Gas per Def. 22.3.3 exists if supp UC R+. Then by Theorem 23.1.1(a) 7 =U * G is 
eff Bs OPN os ee : 
a solution of i —AT=U, where again A7= Ay . (2) largely coincides with the 
a? 
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retarded potentials defined by (19.3.5/1). In contrast to the considerations described there, 
here we do not need any assumptions on the differentiability of u(x, ¢). 


Remark 2. In the same way it is possible to indicate fundamental solutions for the one-dimen- 
sional and the two-dimensional wave equation. We refer to [66]. For the one-dimensional 
wave equation, the regular distribution (Fig. 23.1) 
1 
B(x, o=f2 foie th es |laa 
0 for fs{a|, 
. FG oP 
ve€h,, te hy, is a fundamental solution, i.e., a a2 =° 


} 


Fig. 23.1 


23.2. Initial Value Problems 


23.2.1. Formulation of the Problem 


We already discussed the initial value problem for the wave equation in Sec. 
19.3., and for the heat conduction equation in Subsec. 19.4.3. The problems and 
solutions described there shall now be referred to as “classical” ones. In this 
section we are going to consider generalizations within the theory of distributions. 
We again make use of the terminology of tensor products and convolutions as 
introduced in Sec. 22.3. As we did there, we shall write Up.¢€ D’(R,) instead of 
U )€D'(k,), where a= (2), ..., %,) is the general point in &,. Further, in the follow- 
ing definition, let 6 be the 5-distribution in fy, and let 6’ be its first derivative. 
If¢is the general point in Rj, then we also write 6, instead of 6 (there should be 
no fear of confusion with the terminology used in Subsec. 22.1.3.). By analogy 
we write 6; instead of 5’. If (a, ‘hE Rn.1, c€R,, t€ Ry, then in this sense U)@6’ = 
=U ,x®6; is the tensor product of Uy¢D'(f,) and 6’€D’(R,). Finally, Rt,, and 
——— nr pe 
Ry ., have the same meaning as in Subsec. 19.3.1., and A= » a? which means 

k=| OX; 
that A does not contain any derivatives with respect to é. ; 
Definition. (a) 7f Uy¢D’(R,,), U,eD'(k,), and tf FED'(R,,1) with Sup? Galen 
then UED'(R,4,), with supp Uc nore 
2U 


is savd to be a solution of the initial value problem for the wave equation with the 
right-hand side F and the initial values Uy and BRS <5 
(b) Tf UjpeD'(R,), and tf FED" R,y.4) with supp Fc Rt, ,, then UEeD(R 
with supp Uc Rt, and 
aU 


ap Uae U,@d (2) 


eis 
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as sard to be a solution of the initial value problem for the heat conduction equation 
with the right-hand side F and the initial value U4. 


Remark 1. As was already indicated, (a, #) with x€ R, and ¢€ R, is the gencral point in Heya fie 
Further we have U9 @ 0’ = Up .,@6; €D'(Rn,1) and U,;@Qd= U, 2 ®0,€D"( Rh, .;). Wealso have 
supp (U9@0) C{(x, t) | (x, t)€ Ry, ),t=0}, and analogous relations for U,@d and Uy@0d. 


Remark 2. There arises the problem of how the above definition is related to the ‘classical’ 
Definitions 19.3.1/1 and 19.4.3. 

Theorem. (a) Let U bea solution of the initial value problem for the wave equation as 
per part (a) of the above definition. Let Ug=u(x)EC'(R,), Uy=u,(x)eC(R,), 
P= f(x, NEC (RI,1) and U=ul(a, NEC RT A)N OM Raat) be regular distributions in 
D(R,) and DR, ,1), respectively. Then u(x, t) is a “classical” solution of the initial 
value problem for the wave equation in the sense of Def. 19.3.1/1. 

(b) Let U bea solution of the initial value problem for the heat conduction equation 
as per part (b) of the above definition. Let Uy= ug(x) €C(R,), F = f(a, thE C( Re, and 
O=ulr, t)€ C2 Rz, OC Ri, ,). Then u(x, t) ts a “classical” solution of the initial 
value problem for the heat conduction equation 

Ou 
ot 
ua, 0) —a(4) far. xe R,, 
Remark 3. In the formulation of the above theorem we have allowed ourselves some minor 
incorrectnesses. Instead of F' =f(z, £) eC ,) we should formulate more correctly: let the 
restriction of F to SOUy be a regular distribution in D’ Cos a that belongs to Oe uh ,): The 
wording for U=u/(z, i) shoul be analogous. But since by (nanien we have supp PCRS, 
misinterpretations should hardly be possible. The notations Chea |) ete. were Tareas i 
Subsec. 19.2.1. 
Remark 4. There is a converse to this theorem. For u(x) €C'(R,), u(x) CC(R,) and f(x, t) € 
EC( Hee: let u(x, t) eC (ih) NM ONE) be a classical solution of the initial value problem 
for the wave equation in the sense of Def. 19.3.1/1. If one sets Up =uo(x), U, =uj,(z). 


Fm Gi) sor Gelb, , 120) ies GiGe, )) OP BEE) 5 Fy 
~ \0 for w€Rh,, t<0, ~ 10 Noi? MER, » ERO > 


(7) NG, 1 at) for (az, 1)e Rts 


then U is a solution of the initial value problem as per part (a) of the above definition. For 
ug(x)€C(R,) and f(z, t)€ C(RT |). let u(a, t)€ Oa , Nel sa bea classical solution of the 
initial value problem for the heat conduction equation in the sense of part (b) of the above 
theorem. If Uy, F and U have the same meaning as above, then U is a solution of the initial 
value problem as per part (b) of the above definition. These remarks show that the above 
definition is an elegant generalization of classical initial valuc problems within the theory 
of distributions. 


23.2.2. The Wave Equation 


Theorem. The initial value problem for the three-dimensional wave equation has a 
unique solution in the sense of Def. 23.2.1(a). /f Uy, Uy and F have the meaning 
indicated there, then 


U =Gx(F + Uy@ 6’ + U,@5) ; (1) 
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represents this solution, where G is the fundamental solution stated in Theorem 
23.1.4. 1f Uj= u(x), Uy =uy(x) and F = f(x, t) are regular distributions in D’( Rs) 
and D’(R,), respectively (with supp Fc Rf), then (1) can be represented as 


Oe Ca ee) 
~ aa | lv— yl ges fe us(y) dey (2) 
jz—yl|st lx-y|=t 
i @ 
ahr i ) a, 
y—-al= 


Here x(t) =1 for t20 and y(t) =0 for t<0. If u(x) €C3( Rs), uy(x) € O2( Rs), and if 


f(x, )EC?( RZ), then the regular distribution U =u(c, t) is a classical solution of the 
tnitial value problem for the wave equation in the sense of Def. 19.3.1/1. 


Remark 1. The last part of the theorem again is to be understood in the sense of Remarks 
23.2.1/3 and 23.2.1/4. 


Remark 2. Since supp (F + Up@6’+ U,@6)C lig it follows from Theorem 23.1.4 that (1) is 
a solution of the problem. The derivative at in (2) is to be understood as a distribution 


derivative; thus it is not asserted that the right-hand side of (2) is a regular distribution (in 
contrast to the last part of the theorem, where rather strong assumptions are made on Ug, 
u, and f). 

Remark 3. The last part of the theorem is identical with Theorem 19.3.5/2. Thus in partic- 


ular U=u(a, t)€ CRF) in the sense of Remark 1. So the above theorem is an essential 
generalization of the previous classical considerations presented in Chap. 19. 


Remark 4. The solution (1) continuously depends on the initial data in the following sense, 
For [Ui 4c DR), ean CD (Ry) and {FP }"_,CD(Ry44), with supp Fé c aS 
How (=i, &., 8%, soon, Wat 

Vig) + Uolg),’ Ug) + Uy) for peD(R,) . 

Fy) +F(p) for ye D(Ry+)) and koe. 


U, Uo, Uy and F have the previous meaning. If U* is the solution (1) with F*, Uk and U4 
instead of F, Uy and U,, then U*(y) > U(y) ask ~, where ye D(R,,4). Stability propositions 
of this kind are physically meaningful: small perturbations of the initial data cause only 
small perturbations of the solutions. 


Remark 5. Analogous theorems and remarks can be presented for the one- and two-dimen- 
sional wave equation; ef. Par. 15 in [66]. In the case of the homogeneous wave equation 
(1.e., with # =0) one then obtains generalizations of Theorems 19.3.2/2 and 19.3.3/2. 


23.2.3. The Heat Conduction Equation 


Theorem. Jf Upc E(R,), FEL (K,,,), and if supp Fc Re,,, then 
U =Gx(F + U9@8) (1) 


us a solution of the initial value problem for the (n-dimensional) heat conduction 
equation as per Def. 23.2.1(b). Here G is the fundamental solution given in Theorem 


23.1.3/1. If Uyp=uo(x)€E(R,) and P=f(x, )€E'( Ry), with supp Lehi a, are 
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regular distr — then U=u(x, t) tn (1) ts also a regular distribution, and 


_(e=aF ZA 
x) e M-* ae ( -a3 
u(x, )=—— fly, t) dy dr OA) e  “ aly) dy. (2) 
Be (¢— 1)? Ore? 


' 
Ry 


Here x(t) has the same meaning as in Theorem 23.2.2. 


Remark 1. One has supp (F +U,@6)c Re and F + U)@6€E(R,,, 1). Theorem 23.1.3/2 
now shows that U in (1) isa solution of the Tobie {itis readily verified that supp Uc Re), 
The first summand in (2) is identical with (23.1.3/1) (apart from the modified notation). The 
second summand in (2) is analogous to a 4.3/1). A comparison with Theorem 19.4.3 shows 
that here uo(x) = g(x) €L,(R,) is generally not bounded but, on the other hand, has a bounded 
support. 


Remark 2. In contrast to Theorem 23.2.2 the above theorem does not contain a uniqueness 
proposition. The solution of the initial value problem for the heat conduction equation is 
not unique (cf. [18], Chap. I, Par. 9). Uniqueness can however be enforced by. making addi- 
tional demands upon the initial data and the solutions (limitations of growth as |x| > and 
t+c). Theorem 19.4.3 represents an example of this. 


Remark 3. Stronger requirements for the regular distributions uo(2) and f(z, t) ensure that 
u(z, t) is a classical solution. Further, by analogy with Remark 23.2.2/4, it can be shown that 
(1) depends continuously on the data Up and F. 


24. Fundamental Concepts of Classical Field Theory 


Ame Tensors 


24.1.1. Introductory Remark 


The axiom of classical point mechanics stated in Subsec. 12.2.1. is a variational 
principle. A problem of mechanics was assigned a Lagrangian function L(é, x;,(t), 
2,(t)), and we asked for extremals of the corresponding variational problem in the 
sense of Def. 11.1.3. This led to the Euler-Lagrange equations stated in Theorem 
11.1.3/1. The solutions of this system of ordinary differcntial equations (to which 
initial conditions had been added) then yielded the trajectories of the mechanical 
problem. Thus the idea to describe the dynamics of natural processes by varia- 
tional principles has proved highly successful in mechanics. Therefore it suggests 
itself to ask whether this principle can be extended to other areas of physics. 
We had made some remarks on this subject in Subsec. 11.2.1. In particular we 
had mentioned that the theory developed in Chap. 11 (calculus of variations) 
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will not suffice for such an intent. On this basis it is only possible to develop theo- 
ries whose fundamental equations are ordinary differential equations. But many 
theories of physics are governed by partial differential equations. If we want to 
keep to the lme of obtaining these partial differential equations as a result of 
variational principles, then the basic ideas of the calculus of variations as present- 
ed in Chap. 11 must be decisively generalized. Here first of all it is our aim to 
present the fundamental ideas of the special theory of relativity, including 
electrodynamics. This is a so-called absolute space-time theory: the geometry of 
space is given (the four-dimensional Minkowskian space), and physical processes 
are described in this four-space. On the other hand, in general space-time theo- 
ries such as the general theory of relativity, physical processes will (at least 
partially) determine the geometry of space. These few indications already sug- 
gest that the set of mathematical tools of general space-time theories will be 
much more complex than that of absolute space-time theories. For the presenta- 
tion of the general theory of relativity we will, later on, need the theory of C~- 
manifolds as a basis. Starting from this point, we shall consider tensors and ot her 
geomctric objects on manifolds (Chap. 29). For the moment, however, we can 
permit ourselves the luxury to postpone this rather complex theory to a later 
time. For the special theory of relativity and the electrodynamics associated 
with it, it will suffice for the present to consider tensor fields that are defined 
over a fixed domain in Ry. This will essentially facilitate the understanding of the 
basic ideas of this theory and of its interpretations. The purely mathematical 
aspect of this matter is not essentially different in these two cases: it is tanta- 
mount to a more or less masterly application of the chain rule of the differential 
calculus. In other words, in this chapter we reduce the inathematical foundations 
to the minimum required here and in the following Chap. 25. Those who want to 
know more must also invest more: for understanding the general theory of rela- 
tivity to be presented in Chaps. 30 and 31 it will be necessary to study Chap. 29; 
Chap. 32 will be required for understanding Chap. 33, but for understanding 
Chap. 25 it will just suffice to read Chap. 24. 


24.1.2. The Fundamental Tensor 


Euclidean line element: If one considers the continuously differentiable curve 
x(t)=(x'(t), ..., 2%(¢)) in Ry, then, according to Subsec. 9.2.3., 
t 
iain nee o oe a2" 
s(t) = | V(@"c))24...4 ("%(x))2 dz, ae (1) 
if id 
is the arc length. Within the theory we are going to develop here, it is expedient 
(and common practice) to use superscripts for the co-ordinates, i.e., x=(xl, ..., 
£")€ Ry. From (1) it follows that 


ds \2 n n ¥ 
(zr) = > (PSD BO) HW). (2) 


R= k= 
It is usual (and, as we shall see, highly expedient) to write 


nh 
ds? “4 dada’ = datdc* ce) 


24.7.2. 24.1. Tensors 241 


instead of (2). In the last expression we have used Einstein’s summation con- 
vention: take the sum over all doubly occurring indices, where from one case to 
another it is clear in which limits the sum has to be taken (in our case over 
a Se ee : : ranean 

eee Thus (3) is an abbreviating notation for (2), which indicates that 
the parameter ¢ is of no interest. Besides the Cartesian co-ordinates Ph sony Be? 1M 
FR, (or in a domain in Rp) we now consider curvilinear co-ordinates FED es Fe? 
the sense of Subsec. 8.2.2., ie. x =2* (x!) and a! = a(x), where from now on we 
shall always assume that all co-ordinate transformations of this kind are differen- 
tiable up to arbitrary order (Fig. 24.1). We denote the functional determinant 


, Ces “oy, ..., x* _ O(a! 
(or Jacobian) by Aah ee , requiring that Peta (in &, or in the 


domain where the co-ordinate transformation is considered). If we now change 
tye < 3 
over to x in (3), then we obtain 


% Yo , € ks eo = 
adc2— gridx Kx l ’ where gki= aan . i : a 


Fig. 24.1 


Here we have to take account of the above summation convention. It is easily 
seen that 
O(a) 
det (gr) =| ——— 
on) = (oer 
Here det(g;1) is the determinant of the positive definite matrix (gy1)¢;-,. Hence 
(git)e,1=1 iS a positive definite, symmetric matrix. If one fixes a point P~ x, then 
it is possible to find a non-singular matrix A = A(P) = (api); 7-, such that 
1 0 
Aiea =| °° = (identity matrix) , (6) 
0 1 
where A‘ stands for the conjugate matrix of A. We point out that this a point- 
wise property: A varies from one point to another. Thus (4) is the Euclidean line 
element ds? in general curvilinear co-ordinates. 


\'+0 and gei=gu- (5) 


The line element ds?: As usual in mathematies, one considers an approved theory, 
reformulates it, and looks for meaningful generalizations. Concepts that had been 
derived previously, and hence were results, are now axiomatically taken as a 
starting point to develop an extended theory. So we shall do here, too. Let 2 bea 
domain in R, (later on Q= Ry, and especially 2= R,, will be of interest). Given a 
symmetrical matrix (gxi)é2-4) 1€-, Get =Giks where gpi(c) are real-valued functions 
differentiable on Q up to arbitrary order. Further let det (gaz) +0 on 2. We call 
(git)e.-, the fundamental tensor, which is used to form the line element 


dst=gude*dr',  gei=geilr) 5 4 


where again the summation convention has to be applied. Here x= (.', ..., x”) are 
Cartesian co-ordinates. The decisive alteration as compared with the Euclidean 
line element is that (gg)g 7-1 is not necessarily positive definite. For a fixed réQ it 
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is possible, by analogy with (6), to obtain the normal form 
1 0 
Agni A={ 1-1 (8) 
0 ‘ad 


Here the non-singular matrix A = A(x) depends on x, but not the number of +1 
and —1 diagonal elements, respectively. 


Examples. (1) If Q= Ry and if gus=dg1, 80 that (gxu)~7_, is the identity matrix, 
then ds?=dz*dz* is the Euclidean line element. (2) If Q= fi, and if 


1 1 0) 
md=(, i ai (9) 


then ds? = (dx!)? + (da)? + (da)? — (dz)? is the Lorentz metric, with det (gz:) = — 1. 
Later on this metric will play a decisive role. Sometimes the line element (7) is 
also called a metric, but this has nothing to do with the metric of a metric space 
as described in Subsec. 1.3.3. 


Transformation of the fundamental tensor: As in the case of the Euclidean line 

element, besides the Cartesian co-ordinates z!, ..., x” in Q we again consider arbi- 

trary curvilinear co-ordinates 2’!, ..., 2", where again the previous conditions 

shall hold. Conversion of ds? in (7) to the curvilinear co-ordinates gives 
CxP — axt 

de =o da “de eviereaa,, = 4 10 

Get g Inq ee pea: ( ) 

Lemma: [7 (x'*) and (x""") are two curvilinear co-ordinate systems in Q,and 1} (gi) 
and (gz) are the corresponding fundamental tensors in the sense of (10), then 

f , fy. 

Cin ada? i Ee m) 


ih 


Gut = Goa 5 TT” BT? et (ji) = | det (gj) - (11) 


ale’) 


Remark 1. Formula (11) is easily deduced from (10). The first formula in (11) is called the 
transformation law of a covariant tensor of order two. The second formula in (10) is 
a special case. From (11) it is already observed that the Cartesian co-ordinates no longer 
play a distinguished role. Still they are primus inter pares, but within the more genera] 
theory to be presented in Chap. 29 they will loose this position, too. 


Remark 2. From (1) it follows that g/;=gi;, which means that the symmetry is invariant 
under co-ordinate transformations. 


24.1.3. Tensors 
As usual, let 2 be a fixed domain in Ry. For every system (x!) of curvilinear 
co-ordinates in Q, consider an n*-tuple 
EE NAC WME. on Ih aS (1) 


of real-valued functions that are differentiable up to arbitrary order. Here 
L, 5 M, DP, «+, Gare k indices in total, which run from 1 to x independently of one 
another. 


¢ 9 
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SS es eee 


Definition. (1) is called a tensor field of order k if, for every two systems (x'’) and 
(x'’*) of curvilinear co-ordinates in Q, 
Oxr"% = Ax" Ba" — Ay"? 


MT) SUN ae OP xd fF pelt ee a » 
3 — cout )= Beale) Ox’t Oy! Ax"'6 Ayla 2) 
contra- co- 


variant variant 

Remark 1. We have to sum up over doubly occurring indices from 1 to n. The structure 
of (2) is clear: a superscript on the left-hand side that refers to 2” is also a superscript on 
the right-hand side (for example a). An analogous statement can be made for subscripts. 
The summation is always carried out over opposite indices, i.e., a superscript and a sub- 
script cach. (2) has to be understood as an identity expressed either in 2’ or in x” co-ordi- 
nates. The structure of the formula can almost be kept in mind with the aid of the above 
remarks. The superscripts are called contravariant, the subscripts covariant. 


Remark 2. It is essential that every curvilinear co-ordinate system in Q is assigned a sepa- 
rate n*-tuple of C~-functions. (2) interrelates these separate n*-tuples, which immediately 
raises the question whether non-trivial tensor fields exist at all. If one considers a fixed 
co-ordinate system in Q (e.g., the Cartesian co-ordinates z!, ...,2”) and an arbitrary n*-tuple 
T’--*, ,(z) of C*-functions defined on Q, and sets 


ox!@ ou'® Ox" Ox? 
Oe da ee a 2 
then it is easily verified that in this way a tensor field of order k is obtained (Lemma 24.1.2 


is an example for this assertion). Hence there are tensor fields as numberless as the sand 
of the sea. 


i ae cra) = ee) 


Remark 3 (examples). (1) Lemma 24.1.2 now shows that g,; (from now on we shall omit 

the parentheses) is a covariant tensor field of order two. (2) It will be useful to regard 

(scalar) functions as tensor fields of order zero, that means f’(a’*) =/”(x'""). (3) If f'(a’") is a 
of see 

function on Q, then we set /7;(x’7) Sus (x’") for the partial derivatives, and analogously 


yy 
af” Ox 


=a This means that the prime ,.’“ indicates the partial derivative with respect 
Hs : 


to the following indices. This convention also holds for derivatives of higher orders, e.g., 
of” 


for fie =D gg TE * Now, if /’(z’7) =/’’(x’"’) is a scalar (a tensor field of order zero), then 
Dou 


it follows from 
, dx"! @ 
eae ane : (3) 


that f/,(z’7) is a covariant tensor field of order one (also called a covariant vector field). 
Hence the transformation laws for covariant vector ficlds aj and for contravariant vector 
fields a’* read, respectively, 


tl r) ttk 
ka a 
tt t : th 42 
ap =A] ar" aul = @a Sea tan (4) 


Mnemonie remark (or simply, asses’ bridge). From the chain rule it follows that 


Rik 
ae alee (5) 
Hence the transformation laws for contravariant and covariant vector fields can 
be derived by using the chain rule to convert dx* (contravariant, since k isa 
superscript) into dx and f;; (covariant, since k is a subscript) into f7, which 
is done according to (3) and (5). The general tensor fields arc then built up accord- 
ing to the above principles. 


dz ™ 


244 24. Classical Field Theory 24.1.5. 


Terminological remark. Sometimes we simply speak of tensors (and vectors) 
instead of tensor fields (and vector fields). Usually the n*-tuples TUE oan) 
at afixed point P€2 are called tensors, provided that transformation laws of the 
above form are applicable. But here we exclusively consider tensor fields, so that 
confusions can only occur if we definitely set our mind on them. 


24.1.4. Properties of Tensors 


As was just pointed out, here we only consider tensor fields defined on the do- 
main Qin #&,, but sometimes we choose to simply speak of tensors (and vectors). 


Theorem lt. (a) [f Tey, and Sky» are tensors with the same index pattern 
(z.e., tensors of the same order which have equal numbers of covuriant and contravari- 
ant indices, respectively), and if A and wu are real numbers, then 


Re Gen (1) 


us a tensor of the same type. 
(b) Lf TR) and St". a are any two tensors (whose index patterns are not 


necessarily tdentical), then 
To = Uk.la.b eg (2) 


is also a tensor. 
(¢) (Contraction). If Thy. y..» ts a tensor, then Te-4-l, 4 4 ts also a tensor 
(summation over a). 


Remark 1. A tensor is of type (Ay, ky) if it has £4, contravariant and ky) covariant indices, 
where 4,=0. 1, 2... and ky=0, 1, 2, ... Here k=k,+k is the order of the tensor. Thus in 
part (a) 7 and S as well as the tensor in (1) are of type (dy, ky). If in part (b) 7 is of type 
(ky, ky) and S is of type (hj, k4), then U in (2) is of type (ki +h’, ky +k). If in part (c) 


Phetul yy 8 Of type (ky, ko), where ky=1 and ky=1, then 7-4-1, is of type 
(k,-1, ky — it). 

Remark 2. If 7"; is a tensor of type (1,1), then 7; is a scalar. This is an example for 
part (c) of the above theorem. 


Theorem 2. (a) 4" ={5 jor : =, us a tensor of type (1, 1) in all curvilinear co-ordi- 
nate systems. ie: : 

(b) Lf Ty ts a covariant vector, then Tyi— Ty is a tensor of type (0, 2) (curl, or 
rotation, of T';). 

(¢) Lf gut ts the fundamental tensor, and if g is determined from the systems of 
equations gg, =O", then g* is a symmetrical tensor of type (2, 0) (the contravari- 
ant version of the fundamental tensor). 

Remark 3. If we fix k=1, ..., » and let m vary from 1 to n, then each of the linear systems 
of equations g!'g7m_=6*m (summation over 1) has a unique solution, because det(gjm) +0. 
The assertions of the theorem can be proved in a rather simple way, but are not trivial. 


24.1.5. Metric Geodesies 


Metric geodesics are of essential importance in the general theory of relativity: 
test particles and electromagnetic waves move along metric geodesics in a four- 
dimensional curved space-time that is pointwise a Lorentz metric. Forthe moment, 
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ee ee esos aD 
however, we confine ourselves to the pointwise Euclidean case we are able to 
discuss on the basis of the tools we have developed so far. Again let Q be a do- 
main in R,, and let the fundameutal tensor referred to in Subsec. 24.1.2. be posi- 
tive definite. This means that pointwise it is possible to transform gx into the iden- 
tity matrix, as was done in (24.1.2/6). Then ds? =gy,da*dz" is the line element in Q 
(where we need not necessarily assume that x'!, ..., x* are Cartesian co-ordinates 
ime) elected) =a (mee a (7)) be a continuously differentiable curve in Q (Fig. 
24.2). By analogy with the Euclidean length measurement described in Subsecs. 
9.2.3.,ad 117.2°3. are set 


A eeneaaae 
s= L (xr) =f Vonrth ae! dt. (1) 
a) 
Since gx = gxi(x) is positive definite, one has gut®t! =caé*e*, where c>0. Hence (1) 
is meaningful. We assume that £*4*—0. 


Fig. 24.2 
Definition 1. Curves of minimum length s in the sense of (1) which pass through two 
given points A and Bin Q are called metric geodesics (Fig. 24.2). 


Remark 1. This is a variational problem in the sense of Def. 11.1.3(b), with Lit, a*, é*) = 


=V9,)¢'a!; see also Subsec. 11.2.3. and the remarks we made there. s is again called the 
(arc) length of the curve considered. 


Definition 2. Christoffel’s symbols of the first kind are defined as 


1 
Gk, => (give + Gevi— Qtr) » (2) 
and Christoffel’s symbols of the second kind are defined as 
iy) =gi" (kl, 7} . (3) 


Remark 2. Of course the usual summation conventions are applicable to (2) and (3). 


Theorem. x(s) = (x!(s),...., 2(s)) ts a metric geodesic if and only if 
Deak a l dx” 
ae lil ae ae (4) 
ds? Um) ds_— ds 
where s is the lenght of curve (arc length) defined by (1). 
Remark 3. Formula (4) is obtained from the Euler-Lagrange differential equations (11.1.3/2) 


. da* da! 
if it is taken into account that s is the length of curve, i.e., that gy; es 


Remark 4. In the (global) Euclidean case the theorem coincides with Theorem 11.2.3. 
But the decisive advantage of the above considerations is that we obtain a description of 
metric geodesics which is independent of the curvilinear co-ordinates chosen. If (x*) and 
(a’*) are curvilinear co-ordinates in 2, and if g,; and g;; are the corresponding versions of the 
fundamental tensor, then 
Ea ae. dn’* de’! 
r) —— —-=9)(4") —— ——. 
Gale) ap ap Ile) a 
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Hence the formulation of the problem is invariant under co-ordinate transformation. and 
so is the result (4). The equations (4) are of essential importance in the general theory of 
relativity, but there the fundamental tensor g,; is pointwise a Lorentz metric. 


24.2. Classical Field Theory 


24.2.1. The Model of Field Theory 


In Sec. 12.1. we discussed the subject of modelling in physics. Examples of 
this conception were the classical point mechanics presented in Sec. 12.2. and the 
hydrodynamics described in Sec. 16.1. The pretension of field theory goes farther: 
it is the basis of a good many theories that are described by ordinary and partial 
differential equations. In spite of this generality, the conception described in 
Sec. 12.1. can be reified, especially the black box of a “mathematical theory” 
given in Subsec. 12.1.2. Figure 12.1 now looks as follows (Fig. 24.3). The mathe- 
matical theory consists of a C~-manifold (in the case considered so far it was a 
domain 2 in Ry), a fundamental tensor g,;, and systems of ordinary and partial 
differential equations for tensors and other (invariant) geometric objects. There 
is an essential difference between absolute and general space-time theories. In 
absolute space-time theories, 2 and the geometry, in the form of the fundamen- 
tal tensor gj, are given. (Examples are the Euclidean space Ry, for an n-particle 
system of classical point mechanics, or the four-dimensional Minkowskian space 
of special relativity.) Dynamic objects which must be translated from the 
“physical field” into the “mathematical theory” are then tensors and related 


geometric entities. Among the arrows > in the above figure, it is only v4 which 


is realized here. In general space-time theories the situation is different. Here 
the geomctry, in the form of the fundamental tensor gj; is itself a dynamic ob- 
ject. This means that the geometry is determined only by the given concrete phys- 
ical problem. This characteristic feature of general relativity will be discussed 
in greater detail later on. As was already stated in Subsec. 24.1.1., for the mo- 
ment we confine ourselves to absolute space-time theories. 


translation 


ord, part. phys. Tield 
differential 


eqs. for tensors 


inter — 


pretation comparison 


Fig. 24.3 
24.2.2. Lagrange Densities 


As previously, let Q be an arbitrary domain in R,, and let git be a fundamental 
tensor in 2 in the sense of Subsec. 24.1.2. (Fig. 24.4). If a’* and a’ are arbitrary 
curvilinear co-ordinates in Q, then ’(t) = (x/I(z), ..., a(t) =(2’M(x), ..., a(t) = 
=x""(t) 18 a curve in Q, where t is a real parameter. As we did already in Sec. 
24.1., from now on we shall tacitly assume that co-ordinate transformations, 


ws 
the 
vo 
wD 
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Lo 
ra 
~I] 


Q 


curves etc. are sufficiently smooth, e.g., that they are C»-functions. As in Sub- 
sec, 24.1.2. we admit only co-ordinate transformations x" =.2'%(x'!) sueh that 


Fig. 24.4 


ee) 
See 
e eye th da * ° . 
Definition. (a) LZ [2x'*(z), ee. (t)] as called Lagrange density for curves vf, for 
every two systems of curvilinear co-ordinates x"* and x"*, 
da che | 
i ee i la he), = (r))). 
(2), Se) =z («"*e), Fo) ) 


r r 
(b) L(x®, Te) gy Thy g's) ts called Lagrange density for the tensors 


: 
tie . . a . a 
Tel, g with r=1,..., R, tf, for every two systems of curvilinear co-ordinates x'® 
tse 
and x'"*, 


Y 


r r r5) gl ba 
( ) rath, TE oa DRM acaeg) (2) 


L(a'*, ee ees cas) ~ A(x") 


Remark 1. As in Def. 11.1.3 we demand that the functions L=L(u!,...,u%) are twice 
continuously differentiable with respect to all their arguments. But it is useful (and suffi- 
cient) to require this differentiability not on the total #, but only on certain domains 
(which depend on the formulation of the problem). As regards this, confer the examples 
in See. 11.2., e.g., Remark 11.2.3/1. Of course this involves certain restrictions with respect 
to the curves and tensors admitted. 


ie 
Remark 2. In part (b) of the definition we consider # tensors, 7’, which in general have 
, 3) yr 


ie 
different index patterns. As previously one has 7’---.“s ‘ 


= ae! 
Examples. We consider the Minkowskian space. This means that Q=R,, and 
ute =(0 14° @) 
5 = o 
P=) 


is the fundamental tensor in Cartesian co-ordinates, x= (x1, x2, x, 24). One has 
g=det gu= —1. If x’* is an arbitrary curvilinear co-ordinate system, then it 
follows from Lemma 24.1.2 that g’ =det g;,<0. (According to the transformation 
law (24.1.2/11), gj; is known in arbitrary co-ordinate systems 2 if gg: is given in 
the co-ordinate system x*.) If we consider the curve x(t) = (x!(z), x*(r), x(x), a4(z)) 
in the Minkowskian space (with Cartesian co-ordinates), then 


ds\2 /dx!\2  fdx2 a dx3\2 fda’ \2 
Galle sae Ge Pe 
Later on (x!, x2, x?) will be the position (of a particle), and x4=¢ the time. In this 


: : ee os \2 
sense a curve x(t) is called time-like if Gc <0 for all parameter values t 
T 


ise? 
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ds \? : 
(which are admitted to competition). {If always al >O0, then the curve is 
Tv 


called space-like.} Geometrically this means: if (€!—a!(z))24 (€2—22(r))24+ 


+ (53 — x3(r))2 = (64 — x4(z))? is the right circular cone in R, with a(t) as a singular 


d da! dr? 
point, then the curve x(z) is time-like if and only if the tangent aia A ie ; 
dx3 dx’ dr dz Sar 


= always points to the interior of this cone (Fig. 24.5). To give an ex- 


dr’ dr 


Fig. 24.5 


ample of a Lagrange density for curves, we confine ourselves to time-like curves, 
x(t), following the lines of Remark 1. (For the later applications this will be 


sufficient, since physical processes are described in the Minkowskian space by 


ds \? 
time-like lines, or lines for which i) =0.) Now it can be readily verified that 
T 
cle da tela 
ak So — _ f Sceerentaeeeeentn cent ed 
L(#@), 9) $i (4) 


is a Lagrange density for time-like curves. Here x’* is an arbitrary curvilinear co- 
ordinate system in the Minkowskian space.. To give an example of a Lagrange 
density for tensors, we first of all recall Lemma 24.1.2. If «’* and a’* are any two 
curvilinear co-ordinate systems in the Minkowskian space, and if g’ =det gx, and 


yk 
g’ =det gz, then it follows from Lemma 24.1.2 and from n= 0 that 
pu 
7 _Ox"*) ; 5 
=, a(x’) V =) oe (d) 
Now, if £ is a scalar function such that 
ry tk es ee Fy tlh a Oe 
L(x T aoe T Dl Sie k T esas T fl ata) (6) 


for arbitrary co-ordinate systems x”* and x’’*, then it follows from (5) and (6) that 


r pp e857 G r 
L(x", Dh LD Neel ara) = y-¢ L(x’, ee, Rho ais) (7) 
is a Lagrange density for the tensors 7’. A typical example is 
L=)—g9G"'Ay, , (8) 


where G"’ is a tensor of type (2, 0) and Hy, is a tensor of type (0, 2). (We recall 
that we have to sum up over doubly occurring indices; G“'H,; is a scalar after 
the double contraction procedure.) 
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Remark 3. To avoid misunderstandings, we point out that (1) and (2) are to be understood 
as identities. With the use of the chain rule and of the transformation law for tensors, one 
has to convert both sides of (1) and (2) to a fixed co-ordinate system (e.g., to 7"), Thereafter 
(1) and (2) must be identities for all (admitted) curves or tensors. The above examples 
how clearly how this is to be understood. 


24.2.3. Lagrangian Formalism 


Curves: Wetrea)— (riz), ae(7)), Tj =t=t,, be a curve in the domain Q in Ry. 
As usual, let x*(r) be smooth (e.g., twice continuously differentiable) functions. 
Let ¢(t)=(¢'(z), ..., o™(z)), with ¢(ro)=(r,)=0, such that x(t) +egp(t) are 
smooth (twice continuously differentiable) curves in Q for O<e=1 (Fig. 24.6). 
If Lis a Lagrange density for curves, then we ask for curves x(r) such that 


da* 
| L (40, 3: in) dt 1) 


To 


becomes extremal (or stationary). (1) is independent of the particular choice of 
the (curvilinear) co-ordinates x*. By analogy with Subsec. 11.1.3., this formula- 
tion of the problem means that, for all y(t) with the above properties, 

d 

qe if) == (2) 

E 
where 
ty 


; d 
ie | L( a) ere), Te [24(r) + op4(7}) dz 
to 
Apart from the requirement (24.2.2/1), this is the variational problem with fixed 
boundary values as per Def. 11.1.3(b). 


82 
aI, 


X(T) 


Fig. 24.6 Fig. 24.7 


Tensors: Given a domain 2 and a Lagrange density L for tensors in the sense of 


: 
Def. 24.2.2(b). Desired are tensors 7*--4,_, such that 
if ae 

ea eet pear) x (3) 

@ 
becomes extremal (or stationary) (as referred to the Cartesian co-ordinates ZF) 
for every bounded domain » with @ c 2 (Fig. 24.7). The formulation of the problem 
is meaningful (i.e., independent of the particular choice of the co-ordinate Sys- 
tems): if x’* and x’’* are any two systems of curvilinear co-ordinates im Q, and if 
w’ and w’’ are the images of » obtained by application of a°’* (xr!) and 2x" By), re- 
spectively (which are interpreted here as mappings of 2 in Ay), then it follows 
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from Theorem 9.2.2 and from (24.2.2/2) that 


~ r r 
[ee Pin, (A) da’ 


. r r Ola’? O(x’’¢ 
= f a seh xe) a : aa dx’ 


Ae)" ae 


This means that the problem ean also be formulated without reference to 
Cartesian co-ordinates, in a way that is invariant under co-ordinate transforma- 
tion. The precise formulation of the above requirement now reads as follows. 


: 
If O=e=1, and if V*-,, ave tensors which have the same index pattern as 
ih 


(eet tien let ‘ 
: 


ie r r 
T,(e) = if L(a#, Gel eee. Pe ey Pee ae) dx. 
aw 


Tr 
Desired are tensors T*-!,_, such that 
ier her0 (4) 


r r 
for all bounded domains » with @cQ and all tensors V with V4, ,€ D(a). 


Theorem. (a) (Curves). The curve x(t) is extremal (stationary) in the sense set 
out above if and only tf, for k=1, ..., n, 


Cy an ae e) Dy 
ae Ga)- ©) 
(Euler-Lagrange equations). r 


(b) (Tensors). The tensors (dynamic objects) T-. are extremal (stationary) 
af and only af, for r=1,..., R, 


L OL 

= (6) 
up r 

OTKLy g NOTKLy gl: 


& 


Remark, Part (a) is identical with Theorem 11.1.3/1. In Remark 11.1.3/2 it was explained 
how (5) is to be understood. One obtains a system of n ordinary differential equations. In 
the same way one has to interpret (6), which now gives a system of partial differential 
equations. Here the sum is taken over s, while &, ..., 1, p, ...4.q vary independently from 
1 ton. Further, r=1, ..., R. Thus we have reached one of the goals stated in Subsec. 24.1.1.: 
the derivation of systems of partial differential equations from variational principles. Thus 
for a concrete physical problem one has to find a Lagrange density L such that (6) is 
identical with the corresponding system of partial differential equations. 
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24.3. Examples of Field Theories 
24.3.1. Covariant Point Mechanics 


-\s in Chap. 12, we consider a system of x mass points in Rs, with the masses 
mm, where d= 1,8) The investigations carried out in Subsec. 12.3.1. can 
now be described as follows, If (.°"~?(z), x"~"(z), x°"(z)) is the trajectory of the 
r-th particle in R3, described in Cartesian co-ordinates, then we set, in the sense 
of See. 24.2., 

1 
Q= Rorvt, tan) = (5 “uk (1) 
Here ¢=.2°"*1(z) is the time. Now the Lagrangian function (= Lagrange density 
for curves) stated in (12.3.1/1) can easily be rewritten to fit into our present 
calculus (see below). Here the potential energy V = V(z*) is a scalar function, 
which is now required to be independent of ¢. Then (24.2.3/5) is identical with 
Theorem 12.3.1. The preceding sections enable us to formulate the same problem 
in arbitrary curvilinear coordinates. Let again n mass points be given, then we 
describe the position of the r-th particle by separate curvilinear coordinates in 
f;, which are again denoted by (x"*~?, #*"—!, x"). It is easy to imagine problems 
for which certain curvilinear co-ordinates are much more suitable than Cartesian 
co-ordinates, which take no account of the specific character of the problem. 
If, for example, it is known a priori that a mass point will always move on a 
sphere, then it is natural to use spherical polar co-ordinates with the origin at 
the centre of the corresponding ball. (1) must now be replaced by 
(Gs) 0 
Oe Tie ) Gi (@s) cane . (2) 


0 (As) 
Here (G3) are 3 3-matrices, where the r-th matrix (corresponding to the r-th 
particle) takes the form 


gu=— gap), k=3(r—-l)+a, 1=3(r—1)+8. (3) 
gq 


Here ~, 8 and y can assume the values 1, 2, 3, and r=—1, ..., x. Further, (Ga,p)2,p=1 
are positive definite, symmetrical matrices which belong to three-dimensional 
line elements in the Euclidean space 3, as referred to the curvilinear co-ordi- 
nates (#"~?, 2"—1, x"), Examples are Cartesian co-ordinates or polar co-ordi- 
nates, where 2 has to be suitably chosen. The (37-+ 1)-th co-ordinate 2+! in 
(2) is again the time. 
dz* dz! 

L= ni An Ape 
is a Lagrange density for curves, provided that V(:*) is a scalar function (tensor 
of order zero). 


V(x*) (4) 


Theorem. /f V does not explicitly depend on t=x*!, then the Huler-Lagrange 
equations (24.2.3/5) for the Lagrange density L in (4) read 
1 OV 
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dz! : 5 
Remark 1. One has =. The requireinent that V does not explicitly depend on a7 +1 
Tt 


: y 
means that we consider a conservative system in the sense set out in Subsec. 12.3.3. ea 
. a2 
are the Christoffel symbols of the second kind as per Def. 24.1.5/2. From qa” it follows 
that we can set ¢=1 without loss of generality. 


Remark 2. The interpretation of'(5) is clear from the way we proceeded so far: with t=, 
(x3"—*(t), a'r l(t), v(t) is the trajectory of the r-th particle, with the time as a parameter. 
With given initial data x*(to) and 2*(to), the system of equations (5) has (at least locally) a 
unique solution. If V(x’) =0, then it follows from Theorem 12.2.3 that L is constant along 
a trajectory. The constant is positive. Hence, except for a positive constant, ¢ is identical] 
with the are length s in Subsec. 24.1.5. For V(a*) =0, (5) is thus identical with (24.1.5/4). 


Consequently, a force-free system, being characterized by V(a*)=0, moves 
along a metric geodesic in Ry. (6) 


We will find this proposition again in the general theory of relativity. It is of special interest 
(here as well as later on) that this proposition is independent on the particular choice of 
curvilinear co-ordinates. 


Remark 8. According to Newton’s axioms, 


In an inertial system, a mass point under no forces 
moves uniformly in a straight line (Lex prima). (7) 


(5) shows that there are two kinds of forces that counteract a uniform motion in a straight 


eee 
of physical origin (gravitational forces, electromagnetic forces etc.), and its existence has 
nothing to do with the specific character of the chosen co-ordinates 2*. On the other hand 
the ficticious force has no physical cause, it is, so to speak, a measure of the deviation of the 
co-ordinate system chosen from an inertial system. It is however possible to regard (6) as 
an elegant gencralization of (7). 


1 OV 
line, #/=0: the real force = g*! — and the fictitious force teal a*4™. The real force is 


24.3.2. The Maxwell-Lorentz Equations of Eleetrodynamies 


Maxwell’s equations are the fundamental equations of electrodynamics. When written 
in the four-dimensional notation with reference to the Minkowskian space, they are also 
called Maxwell-Lorentz equations. Here we consider the equations for the .free space. 
In this subsection we shall derive them from a variational principle, in Subsec. 24.3.3. we 
shall describe interpretations and transformations. Typical problems will be discussed in 
Sec. 25.3. Later on, within the general theory of relativity, we shall repeatedly discuss the 
Einstem-Maxwell equations; cf. Subsec. 30.1.3 and Sec. 33.5. 


Minkowskian space: As a starting point we consider the Minkowskian space 
referred to in the examples in Subsec. 24.2.2. So let Q=R,, and let the funda- 
mental tensor g,; have the form (24.2.2/3) in Cartesian co-ordinates, If (ls 
x, a4) are any curvilinear co-ordinates in the Minkowskian space, then the 
corresponding fundamental tensor is obtained from (24.2.2/3) by well-known 
conversion rules, cf. (24.1.2/11). From this formula it also follows that f= 
det gx7< 0, which has also been used in the examples stated in Subsec. 24.2.2. 
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Theorem. (a) Jf F* ds a contravariant tensor and if Aj ts a covariant vector in 
the Minkowskian space, then 


ee er 
US ¥—g PgirMsE” —~ > V-g Fel (Agni—Arz) (1) 


is a Lagrange density in the sense of Def. 24.2.2(b). 
(b) The Euler-Lagrange equations as stated in Theorem 24.2.3(b) for L with 
respect to the dynamic objects F* and A; read in Cartesian co-ordinates 


Fy = Ap ne Ary; with Fy = Gaon (2) 
and 
Fel,=0. (3) 


Remark 1. By Theorem 24.1.4/2(b), 4;7— Ay, is a covariant tensor. Of course the previous 
convention again has to be applied to (1), (2) and (3): one has to sum up over doubly 
occurring indices. Then ZL has the structure of (24.2.2/8), and hence is a Lagrange density 
for tensors in the Minkowskian space. 


Remark 2. As was already pointed out repeatedly, for the moment we are considering 

absolute space-time theories. Hence the fundamental tensor in Z is not a dynamic object, 
r 

and the tensors 7’ in (24.2.3/6) can be identified with F*! and A,. For arbitrary curvilinear 

co-ordinate systems in the Minkowskian space one obtains 


‘ er 7 (Saminz ee: i. 
OL ol ——— i= 
) = ope — ae ee —9 Feri? —> V—9 (Agi —Ape) - (5) 
It is now easily seen that both Fy; in (2) and #*! are anti-symmetrical, that is, F,;= —F yy 
and F4'— — F*, Tf this is taken into account, then for Cartesian co-ordinates the formulas 


(2) and (3) follow from (5) and (4), respectively. 


Remark 3. The formulas (2) and (3) are the Maxwell-Lorentz equations for the electro- 
magnetic field in the free space. In the next subsection we shall rewrite these formulas into 
the common, non-relativistic form, which looks much more intricate. As compared with 
this form, (2), (3) is a very clegant version. But it would be an error to believe that this 
elegance cannot be surpassed. We refer to Sec. 33.5. See also [64], p. 30, for the various 
formulations of the Maxwell equations. 


Was it a god who wrote these signs? 
(Boltzmann on Maxwell’s equations) 


24.3.3. Interpretation and Transformation of MaxwelP?s Equations 


Interpretation: As regards the modeling described in Subsec. 12.1.2. and its 
concrete formulation in Subsec. 24.2.1., we have the situation illustrated in 
Fig. 24.8. The “translation” is accomplished by means of the Lagrange density 
L int (24.3.2/1). But for the moment it is still unclear how #*! and A, are con- 
nected with physical quantities (we already have done the translation, but 
just don’t know the outcome). In the Minkowskian space we only just consider 
Cartesian co-ordinates, which we denote by (2, y, z, f)= (x1, x2, 73, r4). Thus it 
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Minkowskian space 


lL electro -~ 
~—— dynamics 
in tree space 


, interpretation 


Fig. 24.8 


is clear how these co-ordinates are interpreted: x=(x, y,z) are the space co- 
ordinates in the real three-dimensional space, and ¢ is the time. If 

(2, t)=(H,, Ey, Ez) is the electric field intensity and 

%8(z,t) = (Bz, By, Bz) is the magnetic field intensity, 
then the tensor Fy; is identified with 


(Om esse sh, Se0r 
ee es ey 
(Pai) = = Bo OS fae (1) 
fe on. Cant 


Further let 
(Ax) = i aby, A,, ) = (QL, ®) ’ (2) 


where %(=(A,, A,, Az) is the vector potential and @ is the scalar potential of 
electrodynamics. All functions depend on #=(z, y, z) and t. 


Theorem 1. 7f Fy, and Ay have the meaning stated in (1) and (2), then the Maxwell 
equations (24.3.2/2) and (24.3.2/3) read as follows: 


B=rot WA, C=grad ——- (3) 


AS 
rot y=, div ©=0. (4) 


Remark 1. Here (3) is the rewritten form of (24.3.2/2) and (4) is the rewritten form of 
(24.3.2/3). Altogether we have 10 equations, of which 6 are contained in (3), and 4 in (4). 
We recall the meaning of div, rot, grad: 


OH, OH, OE, dP OD IA@ 

5 ban 9s grad O= (= By =) 
OA, OAy OA;z “OA, OA, OA, 
t Y= = ees 
ls (= - Oz? Oz Ox’ Ou | 


The components of rot {& are the subdeterminants (the signs being taken into account) of 
the symbolic matrix 


sk aa 
ok cd 


a0 9d 90 


dx dy dz 
ae A, A, 


Y A, OA Ale 
Finally, - d -(* ee 


ae 
at aye o aye ull by analogy with re 
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Remark 2. As has been stated, (3) and (4) are the rewritten versions of (24.3.2/2) and 
(24.3.2/3). Here we have 


OMB. 8, E, 
uy] Be © -B, #, 


=IRy OP ea oe 
See emir 


Theorem 2. From (3) and (4) it follows that 


eB 
rot E+——=0, div B=0, (5) 
c(s 
rot a5 0, div @=0. (6) 


Remark 3. This is the common form of Maxwell's equations in the free space in the absence 
of charges and currents. (5) and (6) can easily be deduced from (3) and (4) (only (5) is new). 
Conversely, it follows from (5) that € and $ can be represented by (3), provided that % 
and @ are chosen suitably. (6) is not modified in this form. In the presence of electric 
charges with the charge density g, and of electric currents with the current density 
J=(jzs Jy: Jz). (5) remains unchanged, whereas (6) is replaced by 


a 
rot B eae div €=4z0 . (7) 
One then obtains Maxwell's equations in the free space in the presence of charges and cur- 
rents. Typical initial value problems for these Maxwell equations will be discussed in Sec. 
25.3. Corresponding problems for the Einstein-Maxwell equations will be studied in Sec. 33.5. 


Remark 4. Before the Maxwell equations were established, electric and magnetic fields were 
largely considered separately. The fact that these two fields are coupled so closely, as it 
is expressed in the equations (5) and (7), may well have been one of the reasons for Boltz- 
mann’s comment cited above. The relativistic formulation and the interpretations to be 
given in Chap. 25 will show even more: what is a purely electric field in one inertial system, 
may appear as a mixed electric and magnetic field in a different inertial system. 
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Absolute, true, and mathematical time, of itself, 
and from its own nature, flows equably without 
relation to anything external. 

Absolute space, in its own nature, without relation 
to anything external, remains always similar 
and immovable. (Newton, Principia, 1687, [46])!) 


From this hour on, space as such 
and time as such shall recede to 
the shadows and only a kind of 
union of the two retain significance. 
(Minkowski, 1909) 


25. Principles of Special Relativity and Electrodynamics 


25.1. Lorentz Group and Space-Time» 
25.1.1. Minkowskian Space and Inertial Systems 


The Minkowskian space was introduced in Subsec. 24.2.2. We have Q= Ry. 
Further, 


A 0 j 
(a1) = (Yx1) \, 1 33 i (1) 
is the fundamental tensor in Cartesian co-ordinates, which from now on we 
shall denote by (x1, x2, 23, x4) or by (x, y, z, é), as required in the particular case. 
The latter notation refers to the physical interpretation given in Subsec. 24.3.3: 
(x, y, z) is the position in the real three-dimensional space, and ¢ is the time. 
It is painful to dispense with the free choice of arbitrary curvilinear co-ordinate 
systems in the Minkowskian space, but for the moment it is mathematically 
and physically necessary. It is our aim to describe the new space-time concep- 
tion of the theory of relativity. Within special relativity one has to accept con- 
straints that restrict the range of validity of this theory. This is because an 
unscrupulous handling of this calculus leads to awkward inconsistencies from 
both the mathematical and the physical point of view, as the twin paradox to 
be described in Subsec. 25.2.1. will show. These constraints will no longer 
exist in general relativity. Then the new space-time conception and the possibil- 
ity of choosing arbitrary curvilinear co-ordinate systems are reconciled with 
each other. The price to be paid for this is an extension of the calculus presented 
in Sec. 24.1., especially of its foundations. 


Inertial frames: The special theory of relativity refers all its statements to 
inertial frames. The three-dimensional Euclidean space with the Cartesian 


') Einstein (1933): “Newton felt by no means comfortable about the concept of absolute 
space, ... of absolute rest... . [and] about the introduction of action at a distance.” 

*) “T regret that it has been necessary for me in this lecture to administer such a large 
dose of four-dimensional geometry. I do not apologize, because I am really not responsible 
for the fact that nature in its most fundamental aspect is four-dimensional. Things are 
what they are...” (A. N. Whitehead, The Concept of Nature, 1920). 
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co-ordinates (x, y, z) (or a subdomain of it) and an associated time measurement 
¢ are called an inertial frame if, as referred to it, 


every force-free mass point moves uniformly in a straight line. (2) 


In our terns this means that the Minkowskian space with the Cartesian co-ordi- 
nates (x, y, z, f) and with the above interpretation of position and time is admitted 
to competition only if (2) is fulfilled. Mathematically this is satisfactory, if 
only because, in the mathematical part of the considerations to follow, we 
shall never make use of this definition (but we shall do so in the physical inter- 
pretations). From the physical point of view, there arises the question of wheth- 
er there are inertial frames in nature, and how to find them. If one has a 
space in which there are no acting forces (i.e., forces due to physical causes), 
then one can take the trajectories of three mass points moving freely (and not in 
one plane) to construct a Cartesian co-ordinate system. Then the time scale 
must be so chosen that the mass points move at a constant velocity. The result 
is an inertial frame. One can also proceed in a (physically) simpler way: experi- 
ence shows that a Cartesian co-ordinate system which has its origin fixed at the 
centre of the sun and is oriented in a rigid relation to the starry sky is an inertial 
frame, provided that time is measured by a common physical procedure (en 
by means of an atomic clock). 


Remark. The considerations described in Subsec. 24.3.1. now show that with such a con- 
ception it is not possible to calculate with arbitrary curvilinear co-ordinate systems. 


The ether hypothesis: Let (7, y, z,¢) be an inertial frame with the space co- 
ordinates (x,y,z) and the time ¢. Consider another frame (x’, y’, 2’, t’) that 
moves uniformly in a straight line with respect to (x, y, z) in the three-dimen- 
sional space (Fig. 25.1). Time measurement is supposed to be identical in the 
two frames, that is, ’ =¢. With a suitable normalization one then obtains the 
Galilean transformation 


ra=x'+out, yoy +t, c=2’ +f, t=l’. (3) 


Fig. 25.1 


This is a simple geometric conversion. It is immediately seen that (2) also holds 
for (2’, y’, 2’, ’), so that one obtains a new inertial frame. The two frames are 
fully equivalent, and the laws of classical mechanics are invariant under Gali- 
lean transformations. If one applies (3) to equations of electrodynamics, e.g., 
to (24.3.3/5)—(24.3.3/7), then it is seen that this invariance is lost. The formulas 
change their form. By the end of the past century and at the beginning of this 
century, this led to the so-called ether hypothesis. According to this hypothesis 
there exists a distinguished inertial frame in which both Maxwell’s equations 
and the fundamental equations of mechanics hold in their common form. If 
one considers a frame of reference that moves uniformly in a straight line with 
respect to the ether, then one has to convert the formulas of electrodynamics 
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according to (3). The ether was a hypothetical medium in which electromagnetic 
waves were supposed to propagate. In particular, the velocity of light was 
assumed to be constant and independent of direction in this ether!). For the 
purpose of this chapter, let us normalize the value of the velocity of light to 1. 
The constancy of light velocity also opens the possibility of verifying the 
existence of ether by experiment: if a reference frame moves relative to the 
ether, e.g., according to (3), then the velocity of light, as referred to this frame, 
must generally be different from 1. The effect is small, but at that time it was 
already within the reach of the art of physical experimenting. But the ingenious 
experiments carried out on this subject altogether yielded negative results: 
within the limits of measuring accuracy, a deviation of the velocity of light 
from 1 was not observed. 


Constancy of the velocity of light: In 1905, Einstein postulated that: 


The velocity of light in empty space ts equal with respect to all 
inertial frames 


‘(=1 in our calculus). Of course this contradicts the classical Newtonian con- 
ception of space and time, and especially is inconsistent with (3). Further it was 
required that the form of Maxwell’s equations should remain unchanged under 
a transformation from one inertial frame to another. It was postulated that 
all inertial frames are perfectly equivalent and that it is impossible to distin- 
guish one of them by mechanical or electrodynamic experiments of any kind. 
It is clear that these requirements could only be fulfilled by a new conception 
of space and time. 


25.1.2. World Lines 


Before describing the mathematics that underlies special relativity, let us 
develop the ideas presented at the end of Subsec. 25.1.1. somewhat further. 
Consider the Minkowskian space with the Cartesian co-ordinates (x, y, z, é), 
where x=(2, y,z)€ Rs, is the position in the real three-dimensional space and 
tis the time (Fig. 25.2). We shall always assume that (., y, z, f) is an inertial 


| X(T) 
| 


~) 


{ (Pie, Bap 

frame. The world line of an observer who is at rest at the position # is a straight 
line which is parallel to the ¢-axis and passes through #: the point (2, ¢) indicates 
that the observer is at the position % at the time ¢. A moving observer has the 
world line (a(t)=x'(r), x2(z), a(t), 24(z)) (solid line in the figure). The trace 

') In 1760, Euler said: “Accordingly, light is nothing else but a motion or vibration in the 
smallest particles of the ether, and this is present everywhere, because of its extraordinary 
fineness, by virtue of which it penetrates all bodies.” 
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(broken line) in R, indicates the position, and x(t) indicates the time. Let us 
choose t=¢ as the curve parameter t. For physically meaningful curves this 
does not mean any restriction (the time avis is oriented, and the time scale is 
passed only from —o to ). If @=(w), uo, uy) is the common, three-dimensional 
velocity, then one obtains 


dt) 1 i ea eae 
ae =(Uy, we, Uy, 1), lel=V~uit+agtud. (1) 
Physieal principles: In accordance with the constancy of light velocity, the 
following is postulated: 

If physical information ts transported 

along the world line x(t), then |u| =1. 


In particular this holds for electromagnetic waves (where |a@| = 1) and for the 
world lines of material particles. If always |@| <1, then the curve is called time- 


like. See also Subsec. 24.2.2. As was already mentioned there, for time-like 
curves one has 
de dr 4 
Nir et +3 —-1<0 
de ae : : 


and the tangent (1) to x(t) always points into the interior of the right circular 
cone with x(z) as a singular point as described there (Fig. 25.3). 


X(T) 


we <> 


R; Mig. 25.3 


25.1.3. The Lorentz Group 


Consider the Minkowskian space with the Cartesian co-ordinates x* and with 
(25.1.1/1) as the fundamental tensor. The interpretation as a space-time and 
the requirement that the co-ordinate system considered shall be an inertial 
frame are of no interest for the moment. Let x*=a*jr"', with det (a*,) +0, 
where, as previously, we have to take the sum over doubly occurring indices 
(from 1 to 4). With respect to the new co-ordinates x”, according to (24.1.2/11) 


ee Ox" Oita 7 Fa “sin Ou) 
Chi a i in? le Ts 
J ox'® Ox? i 


is the associated fundamental tensor. 
. . . ners 
Definition 1: 2* =a',x"' is called a (general) Lorentz transformation 7f gy= Yat « 


Remark 1. This means, what is desired are affine transformations in the Minkowskian space 
which do not change the fundamental tensor (7,7). Thus it is required that 


2 
MeL =A" EA rs « (2) 
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Theorem 1. The set of all Lorentz transformations forms a group (the Lorentz 
group) where the group multiplication is defined as the successive application of 
the transformations. 
Remark 2. The proof is rather simple. The identical transformation is ‘the unit element. 
Two properties will be of interest later on. If the determinant is formed on both sides of (2), 
one obtains 

—1=(det a*))2 (—1), hence deta*;=+1. (3) 
Further one obtains from (2) 

1144 =(a'e)?+ (a%)2+(42,)?—(a%s)?, hence a’) =A. (4) 
Definition 2. 2*=a*,x"' 71s called a proper Lorentz transformation if gi =Nea 
deta = laid a4 = 1. 


Remark 3. By (3) and (4) the additional requirements (as compared with Def. 1) are mean- 
ngful. 


Theorem 2. The set of all proper Lorentz transformations forms a group (the 
proper Lorentz group). 


Remark 4. So the proper Lorentz group is a subgroup of the Lorentz group. 


25.1.4. Special Transformations of the Proper Lorentz Group 


The transformation V(v): We look for proper Lorentz transformations of the 
form 


130 0 0 

: Cael 0 0 

OAC DADS Si Sanne (1) 
0 0 -vy y 


, where 


From y=a'‘,2=1 and det V(v)=y2—»%2=1 it follows that y= 
1—v? 


lv|-<1. If we again set (x)= (a, y, z, 1), and accordingly for (x), then we obtain 
z—ut i—VE 
ar ae a ee (2) 
V1—v2 ¥1—=# 
(The fact that we interchanged the primed and the unprimed co-ordinates as 
compared with Def. 25.1.3/1 is of no account.) Now it is readily verified that 
(2) is in fact a proper Lorentz transformation. For we have 


(P+ (Y P+ P= C22 + Pt E+E —L) 


= 72 yet (1—v)(2+O(14 v)(2-H=224+y2422-2 


1—v? 
and hence g;;= 7). The transformation inverse to V’(v) is V(—»). 
Rotations in Rs. Let 
Chl ei Coie d: 0\33 
=|) a2 3] and D=( 4 1 (3) 


0: 1)% 
%31 &39 M33 ee: 


ea = - ; , 
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aoe (ieee) d 
where aarti = b= {9 eg ee (summation over r from 1 to 3), and det d=1. 
Then d is a rotation matrix in fs. Hence in particular one has (x’)2+ (y’)2+ 
-\9 9 ‘ * 4 
+ (2)? =a7+ y? +27. Since det d=1, D is then a proper Lorentz transformation. 


The transformation D-!l(v)D. Together with D, D-! is also a proper Lorentz 
transformation, and hence also D-!J’(v)D. These trausformations are of physical 
interest. We need some structural informations. If (w,, wo, w3)€ Ry is the unit 
vector transformed into (0, 0, 1) by the rotation d as given in (3), then it is 
easily seen that 


. ; x ty! 0 

_ Fe, Wo 0) 
Den 1 2% wy | 0 ane JE ws) O ©) 

‘OC O10 1 Oa 


where * indicates that values of the places in question are of no interest. From 
(1) we then obtain 


0 
ae 7 i 0 
DEE OD )-? 
ee ee ee 2) 
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25.1.5. Space-Time (Physical Aspects) 


Consider any two Cartesian co-ordinate systems (x, y, z) and (2’, y’, 2’) in the 
real three-dimensional space /, (Fig. 25.4). Suppose that the observers 0 and 0’ 
are located at the origins of these systems, and that they do not leave these 
positions in the course of time. 0 and 0” agree that they will measure times and 
lengths by identical physical processes, say, by means of atomic clocks and 
standardized rigid bodies, or by means of the wavelengths of fixed oscillations. 


% YS Fig. 25.4 


The question of length and time measurement is not so unproblematic as it 
may appear at a first glance, because for the actual realization one inakes use 
of physical facts which, in their turn, must be described in space and time. Fora 
detailed discussion of this problem the reader is referred to [3] as well as to 
[26]. Here we suppose that the two observers 9 and 0’ measure lengths and time 
intervals each by the same method in laboratories which are at rest relative to 
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ae 


them. Thus 0 and 0’ are provided with quantitative space-times (x, y, z, ¢) and 

(x, y’, 2’, U), respectively, so that it makes sense to speak of velocities. We sup- 

pose that 0 makes the following statement concerning 0’: 0’ moves uniformly 

in a straight line, and the axes of the co-ordinate system (a, y, z) can at any 
time be transformed into the axes of the co-ordinate system (2’, y’, 2’) by apply- 
ing one and the same time-independent rotation matrix AR. Further we suppose 
that at the time ¢=¢/=0 both 0 and @’ are at the same position (that means 
r=Y=2=% =y'=2/=0). Then 0’ moves at thé velocity v in a straight line 
which passes through x= y=z=0 and the direction of which is given by the 
unit vector w= (w,, v2, w3). Thus at the time ¢ the position of 0” is vtw = t(v4, vo, v3). 
Now the observers @ and 0” are able to compare their quantitative, physically 
determined space-times (x, y, z, ¢,) and (2’, y’, 2’, ’) by experiment. The Galilean 
transformation (25.1.1/3) as a conversion formula does not fit the failure of the 
experiments devised to verify the ether hypothesis (cf. Subsec. 25.1.1.) (here 
we assume that F is the identity matrix). Rather there arises the problem to 
transform these two space-times into one another in such a way that they are 

compatible with the physical postulates stated in Subsecs. 25.1.1. and 25.1.2. 

and stand experimental tests. The experimental aspect is not our concern. 

For a description of experiments that support special relativity, the reader is 

referred to [26]. In the next subsection we shall state by axiom how (z, y, z, t) 

is transformed into (2’, 7, 2’, t’). But before doing so, let us once again recall 

those properties which are desirable for such a transformation. 

TY. The systems (x, y, z,¢) and (2’, y’, 2’, ’) are equivalent. If one fixes a 
procedure that transforms (zx, y, z,¢) into (2’, y’, 2’, t’), then the same 
procedure must also transform (zx’, y’, 2’, “) into (x, y, z, t) in case the 
primed and the unprimed co-ordinates change parts. 

Il. If (2, y, z, t) is an inertial frame, then (2’, y’, 2’, t’) is also an inertial frame 
(and vice versa). 

III. The Maxwell equations in their classical form (24.3.3/5)—(24.3.3./7) remain 
unchanged under a transformation from (a, y, 2, ¢) to (2’, ’, 2, i) (and 
vice versa). 

IV. If x(x) is the world line of a physical event in (x, y, z, ), then let x(t) be 
the transform of this world line to (2’, y’, 2’, ¢’)-co-ordinates. Let the three- 
dimensional velocities in terms of (25.1.2/1) be denoted by |#| and |z’!. 
Then we require that |%|<1 implies |w’|<1 and that |%/=1 implies |7’| =1 
(and vice versa). 


Remark 1. The statements I, IT, and IIT correspond to the requirements made at the end 
of Subsec. 25.1.1. The statement IV is a strengthened version of the physical principle 
stated in Subsec. 25.1.2., which includes the constancy of light velocity as a requirement. 


Remark 2. The corrections applied to Newton’s mechanics by special relativity are meas- 
urable only if the relative velocity v of the two co-ordinate systems (x, Wn th) auanel (2, 7, 
2, t’) is not too small compared with the velocity of light. Thus it appears resonable to 
require that, as |v| 0, the transformation from (a, y, z.t) to (a’, y’, 2’, ‘’) approaches the 
Galilean transformation (25.1.1/3) (where # is the identity matrix). 
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25.1.6. Space-Time (Mathematical Aspects) 


The rotation matrix FR, the unit vector W=(Wy, Wy, wy) and the velocity v 
shall have the same meaning as in Subsec. 25.1.5. In accordance with the 
physical principles stated in 25.1.2. we assume that |v! <1. (v<0 is admitted, 
so that w can be replaced by —1). If one sets 


z R| 0\} 3 
h=(= ‘ee () 


then R is a proper Lorentz transformation. 
Axiom. The space-times (x, y, 2, t) and (x’, y’, 2’, U’) described in Subsec. 25.1.5. 
are transformed ‘nto one another by 

Ga ate HED WR) Diary, =o) (2) 
Here V(v), Dand D~' have the meaning given in (25.1.4/1) and (2554/4), 


Remark 1. According to Theorem 25.1.3/2 and to the considerations described in Subsec. 
25.1.4., RD~!V(v)D is a proper Lorentz transformation. 


Remark 2. P is unique. whereas D in (25.1.4/4) is not. In order that (2) be meaningful, one 
has to show that (2) is independent of the particular choice of admissible matrices D. If 
D, and Dy, are any two admissible matrices, then D,D*! transforms the vector (0, 0, 1, 0) 
into itself and the vector (0, 0, 0, 1) into itself, too, and hence has the form 


. [00 
a 00 
ees ero 1 0) 2 
0 0 OL 


Dz and Dz! commute with V(v). Then it follows from D)=D3D, and Dy!=D;1D;! that — 
Dy1V(e)D2.=Dj1Dz1V(v)D3D,=DzV(o)D, . 
which proves the desired independence. 


Remark 3. Since 7’ = RD-1V(v)D is a proper Lorentz transformation, the cones referred to 
in Subsec. 25.1.2. are mapped onto cones of the saine type (Fig. 25.5). Here the upper and 
the lower half-cones (as-referred to the orientation of the ¢-axis) are again transformed into 
upper and lower half-cones, respectively. But now the considerations described in Subsec. 
25.1.2. immediately show that the requirement IV of Subsec. 25.1.5. is fulfilled. In particu- 
lar (in the formulation used there) one has |@| =1 if and only if |@’|=1: this is the constancy 
of the velocity of light (normalized to 1). 


Remark 4. A motion is rectilinear and uniform if and only if the corresponding world line 
in the Minkowskian space is a straight line. Since 7’ is linear, straight lines are transformed 
into straight lines. This shows that requirement II of Subsec. 25.1.5. is also satisfied. 


ie 
yee ee | 
Z : a 
(nz) Fig. 25.5 
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Remark 5. In particular this implies that the observer 0’ will find that, as referred to his 
own space-time (x’, y’, 2’, t’), 0 moves uniformly in a straight line. R and D do not influence 
the velocity of this motion, so that we may assume that 7'=V(v) is given by (25.1.4/2). v is 
obtained from 0=2z’=2z—vt. Since V(—v) is the transformation inverse to V(v), the same 
arguinent shows that the velocity of 0, as measured in the space-time (a’, y’, 2’, t’), is —v. 
If we now want to apply the axiom by changing the parts of (x, y, 2.) and (a’, y’, 2’, t’). 
then we have to make the following substitutions: 


(Gs G5 Bale Ge R, i) 

(x’, y’, 2’,t’): —v. RA}, DR-!. 
Hence requirement T of Subsee. 25.1.5. is fulfilled if 

(a. y, 2) = R-RD-1V(—v) DR-(2’, y’, 2’, !)=D-1V(—») DR- a’, y’, 2, #') . 
But according to (2) this is true. 


Remark 6. Finally, requirement IIT of Subscc. 25.1.5. is fulfilled, as is shown by the deri- 
vation of (24.3.3/5) and (24.3.3/6) from the invariant equations (24.3.2/2) and (24.3.2/3). 
Thus we have reached our aim: (2) is supported by experiments within the limits of meas- 
uring accuracy, and the requirements I-IV of Subsec. 25.1.5. are fulfilled. Finally we 
recall the (necessary) self-restriction pointed out in Subsec. 25.1.1.: the space-time (2, Yee 
t) (and hence, by requirement II if Subsec. 25.1.5., also (2’, y’, 2’. ‘’)) is an inertial frame. 
According to the above considerations, the fundamenta] equations of electrodynamics are 
invariant under the proper Lorentz transformations. On the other hand the fundamental 
equations of mechanics are not invariant under proper Loentz transformations (whereas 
they are invariant under Galilean transformations). In order to achieve invariance, one 
thus has to correct the fundamental equations of nechanics (and not those of electrodynam- 
ics, as was intended originally, when the Galilean transformation had been taken as a 
basis). 

Remark 7. In the c.g.s. system of units, where ¢c ~3 + 105 km/s is the value of the velocity 
of light, the last two formulas in (25.1.4/2) read 

ve 


z—uwt Be 
a — 


/ a ot ea 
Te te 


If |z| is much smaller than c, then these formulas (if interpreted liberally) approach 
z =x—vt and t’=t. See also Remark 25.1.5/2. 


Remark 8. If fis the identity matrix, then from (2) and (25.1.4/5) one obtains the important 
formula 
t =(2. x) 
Vi-v2 
Here (0, ) =0,e + v9y +042 is the common scalar product in Rs. 


» Where 8=(v4, vy, V5) =v(w,, Wo, 3) and #=(x, y, )) (3) 


Re 


25.2.  Hffects of Special Relativity 
25.2.1. Time Dilatation and the Twin Paradox 


It is not our aim to give a systematic description of the theory of special 
relativity. We shall confine ourselves to the description of several spectacular 
effects that illustrate the radical change of the space-time conception estab- 
lished by Einstein as compared with Newton’s space-time conception. 
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Time dilatation: Consider the same situation as described in Subsec. 25.1.5.: 
two observers 0 and 0’ with the space-times (., y, 2, (anda: 77 oe. 1), which 
represent inertial frames (Fig. 25.6). Let R= HF be the identity matrix (a simpli- 
fication which is physically of no account). The observer 0 trips a flash of light 
each at the times ¢=0 and ¢= 7'=0, thus marking a time interval of length 7 
in the space-time (x, y, z, t). The world line of 0 is a straight line which passes 
through x= y=z=0 and is parallel to the taxis (Fig. 25.7). According to (25.1.6/ 


3), the observer 0’, in his space-time (x’, ’, 2’, t’), finds that the two light flashes 
4 Al 


occur at the times =0 and =7"= =: In particular this gives 7”’> 7’. 
Vl—v2 

In other words, the length of the time interval between two events depends on 

the choice of the space-time (2’, y’, 2’, t’). The time interval is shortest if one 

chooses a co-ordinate system which is at rest relative to the event. 


aes 
Fig. 25.6 Ay Fig. 25.7 


u-Mesons, An impressive (and experimentally well-supported) example of time 
dilatation is shown by the y-mesons. It is known that they are generated at an 
altitude of 10-30 km above the earth’s surface and that (in a rest frame) they 
have a lifetime of 2.2 10-6s. Although they travel almost at light velocity, 
according to the classical argumentation they can travel at most a distance of 
2.2 1078s 3105 km/s<1ikm. But on the other hand they arrive at the 
earth’s surface. The reason for the above false conclusion is that in a co-ordinate 
system which is at rest relative to the earth’s surface (but just not relative to the 
f-meson) a time dilatation occurs, which in the present case amounts to a factor 
of 10? to 103. 


Twin paradox. There is hardly another effect of the theory of relativity that 
has been discussed in so great detail and so vehemently as the so-called twin 
paradox: there is not only a huge crowd of physical and philosophical papers on 
this subject, but whole books have been written about it. The problem considered 
is as follows. Suppose there are two twins. One of them stays on the earth, while 
the other undertakes an extensive excursion to the space. Apart from the 
starting, turnabout and landing phases (for which the time required will be short 
compared with the total flight), twin 2 is supposed to move uniformly in a 
straight line at the velocity v, as referred to a co-ordinate system at the origin 
of which twin 1 is at rest. After the space flight has been finished, the twins meet 
again, and each of the two claims to be younger than his twin brother. This 
collides with the simplest rules of logics, but can be reasoned as follows: let 
(x, y, 2, t) be a space-time in which twin 1 is at rest (Fig. 25.8). Hence his world 
line is a straight line through 0 that is parallel to the t-axis. In this co-ordinate 
system the world line of twin 2 is curved. Neglecting the start, turnabout and 
landing phases, we idealize this curved path into a triangular one, and now we 
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| 
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consider one half of the adventure. If (2’, y’, 2’, () isa correspouding space-time 
for twin 2, then we have the situation investigated above, which gives a time 
dilatation of sie = = which means that 7,> 75. Here 1’, is the proper 

a ay 
time of twin 1 that elapses between departure and return. An analogous state- 
ment holds for 7,. The proper time is the time measured by an observer in a 
laboratory which is at rest relative to himself. It gives the timing for inorganic 
and organic happenings (that is at rest relative to himself), and hence also gov- 
erns the biological clock by which he ages. Now one may interchange the 
parts of the twins, so that one obtains 7',~7',. Thus one has the paradoxical 
result that, when the twins meet again, each of the two will be younger than the 
other. So the above argumentation must be wrong (in case we are confident that 
the mathematical model developed in Sec. 25.1. is in fact suitable to give a 
satisfactory, and hence in particular a logically consistent, description of physi- 
cal reality). 

As a first possible objection, one may ask for the legitimacy of the above 
idealization (i.e. of neglecting the start, landing, and turnabout phases). If one 
thinks of the space flights as being extended at choice, the start, turnabout, 
and landing operations always being executed in the same way, then the above 
idealization appears at least physically plausible. 

The error is hidden elsewhere: at most one of the two space-times (x, y, 2, é) 
and (2’, y’, 2’, t’), which are rest frames for one of the twins each, can be an iner- 
tial frame. If, for instance, (x, y, 2, t) is an inertial frame (which can be supposed 
approximately: if necessary, one settles twin 1 in the centre of the sun, making 
use of the remarks on inertial systems as given in Subsec. 25.1. 1.), then the above 
considerations which led to 7',> 7’, are admissible within the special theory of 
relativity: the considerations refer to the inertial system (x, y, 2, t). On the other 
hand, (2’, y’, 2’, t’) is not a space-time system to which the special theory of 
relativity can be applied. Hence the parts of the twins cannot be changed in the 
above argumentation. At this point it turns out that the restriction to inertial 
systems (x, y, 2, t) is necessary in the special theory of relativity. So the contra- 
diction has been eliminated; what remains is a slight uneasiness. 

Now, how about the real solution? In fact the twin paradox is an effect of the 
general rather than the special theory of relativity. If (as supposed above) 
(x, y, 2, ¢t) is an inertial frame, then in general relativity one cannot fail to 
conclude (without any tricks of argumentation, neglections etc.) that always 
1’; > T') (no matter whether twin 2 departs for a longer or shorter trip, the only 
thing he must do is to travel). Popularly, it has been known all along: “Resting 
is rusting”’. 
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25.2.2. Lorentz Coutraction 


In the inertial frame (x, y, z, 4) we consider a moving rod (Fig. 25.9). To deter- 
mine the length of this rod, we consider the trace points of the world lines of the 
two rod ends in the plane ¢= 7’ = const. The three-dimensional Euclidean distance 
of these two trace points is the length of the rod (at the time ¢= T). We now 
consider two inertial frames (x, y, z, ¢) and (yey 2, tf), where Gr. 7/0 2 jai 
supposed to move uniformly at the velocity v along the z-axis (Fig. 25.10). 
Then, by the axiom stated in Subsec. 25.1.6., the two systems transform accord- 
ing to (25.1.4/2). If (0, 0, 0) and (0, 0, J) are the front and the rear end of a 
rod at rest in the (2, y, 2) systein, then in the (x’, y’, 2’, ¢’) system one obtains 


pe, a 


Z4=——, = for the front end, with t=4, , (1) 
yil—2 ¥i—-wv 

, ¢t—v , -t.—0l 

Ze =, i&=——— for the rear end, with f=¢,. (2) 


oe (ee 5 Y1—wv 


Fig. 25.10 


Fig. 25.9 


The length of the rod in the (x, y, z, ¢) system is /. To determine the length of 
the rod in the (2’, y’, 2’, ) system, we have to set ¢;=¢, and hence ¢, =¢,—vl. 
Insertion into (1) and (2) gives 

(1 — v2) 


, l ie 
=.= | Yl—v?. 


il =e 


Hence /’</, Ina moving inertial frame the rod is shorter than in the rest frame. 


v 
Or 
is 


3. The Relativistic Addition Theorem of Velocities 


Consider two inertial trames (7,7, 2,7) and (7, 7, 2, ¢), where (2, y’, 2) 
moves uniformly in a straight line along the z-axis (Fig. 25.11). By (25.1.4/2) 


one has = iy — 4, 


eal t—vz (1) 


where v is the relative veloeity. Further consider a particle whieh moves uni- 
formly in a straight line with respect to the (x, y, z, t) system (Fig. 25.12). One 
has 

=U, ySugl, z2=Ust, (2) 
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where (uj, us, Uy) is the velocity vector. By inserting (2 ) into o one obtains the 
following relations for the velocity vector (wj, ug, 43) in the (2, 7’, 2’, t’) system: 


x V1—v? Le V1—v ; 


Z  Us—Vv 
Uy == 9 uU oe 
ie 1—vus’ t Te t 


= (3) 


If the particle moves at the velocity wu in the z-direction, ie., if wu; =u.=0 and 
u;=u, then one obtains the relativistic addition theorem of velocities, 
u—v 
uo =——.,  (u’=u5). 4 
(er 3) (4) 
The classical counterpart reads w’ = u—v; thus the factor (1 So is the rela- 
tivistic correction. From the formulas (3) and (4) we can once again read off the 
previous statement that uj+uj+u3<1 implies w?+us?+u42<1 and that 
tenes a = 1, 


25.2.4. The Free Relativistic Particle 


In an inertial frame (c*) = (a, y, z, t), consider a force-free particle that moves 
uniformly in a straight line. We ask for a Lagrange density for curves which 
describes this situation. According to (24.2.2/4) we make the setup 


les dle! 
bana me (1) 


Here « is a real constant eee is unimportant for the time being). Now it 


d2x 
follows from (24.2.3/5) that ve 0, provided that s is identified with the 
dt da! 
ds ds 
constants, one has a2*=u%s. From t=2x'= u's it follows that the motion is recti- 
linear and uniform, as should be expected. 


geodesic parameter, which means that ~—7,, ———=1. Hence, apart from 


25.2.5. Proper Time, Mass, and Energy 


da? da! 
Proper time: From the normalization —1,;— Tne = 1 and from ¢=w4s in Subsec. 
25.2.4., it follows for the force-free particle that 1— —|ul?(w4)2 + (u4)2, with 
k 


ce 
ele (ais )2 = r Sj a . gi 
|oe|2 = (144)? + (tla)? + (wg)? and Ee Thus, after choosing a suitable sign one 


obtains ¢= . Now it follows from (25.1.6/3) that the arc length s is the 


V1—[u2 
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proper time of the particle, that means, the time measured in an inertial frame 
which is at rest relative to the particle. In the general theory of relativity we 
shall then extend this statement to arbitrary motions (which are not necessarily 
rectilinear and uniform). 


Mass and energy: Let us carry out some other speculative calculations, but 
without entering into physical reasons (see for instance [11]). For small veloci- 
ties |u|,by substituting ¢ for s one obtains from (25.2.4/1) 


h= =e PS ee ee (1) 


neces : : my : 
The non-relativistic Lagrange density is Lassa lul?, where mg is the mass 


(rest mass) of the particle. Additive constants are of no importance in Lagrange 
densities (the Euler-Lagrange equations contain nothing but partial derivatives). 
Thus a comparison of (1) and LL. suggests choosing «’= mp), so that D= 


= —m,V¥1—|ul2. The momentum ; is obtained from 


OL MoUy 
Pr = ae aia > (2) 
OU, a= fa{2 


where m=——“"— is the inertial mass. If we calculate the energy by H= 


: VI—luP i 
= >) Pru, —L, then the result is H= —°._—m. Thus one obtains the rela- 
ee V1—lul? 
Sl 7 , E=m, which have to be verified by experiment. 
me 
If the velocity of light is not normalized to 1, then in the c.g.s. system of units 
one obtains Einstein’s famous mass-energy relation EH = mc?. 


Mo 
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25.3. The Maxwell Equations 
25.3.1. Formulation of the Problems 


Maxwell’s equations in free space and in the absence of electric charges and 

currents were described in Theorem 24.3.3/2. We use the notation employed 
there; in particular we write =(r, y, z)€ Rs. 
Problem 1 (initial value problems, classical solutions). Given the vector functions 
(,(z) and Bi (z) that are three times continuously differentiable on Ry. Desired 
are vector functions E(x, t) and B(x, t) which ure twice continuously differentiable 
on Ri, such that 


Ds) 


rot E+ =0, divB=0 in Rr, (1) 
a(t = 

er yo, te =n wes (2) 
Oo 


and 


E(z, 0) =C, (7), B(x, 0) = Bo (x) in ty. (3) 
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Remark 1. As in Subsec. 24.3.3., G(4, ) and ®(@, t) each have one x-component, one y- 
component, and one z-component. The same holds for &(@) = (Ho,2(@), Loy(#), Zo,.(#)), and 
accordingly for ¥(%). This is a typical initial value problem, being analogous to the initial 
value problems for the wave equation in Sec. 19.3. Given are the electrie and the magnetic 
field intensity at the time ¢=0. For physical reasons the above problem should then have a 
unique solution for ¢>+0. 


Remark 2. If charges and currents are present, then, as was described in Remark 24.3.3/3, 
the formulas (1), (2) must be replaced by 


a8 

rot & pO div B=0 4 (4) 
0 

rot % — op = 40); div E=4z0 . (5) 


Here g is the charge density and j=(j,, Jy Jz) is the current density. The formulation of 
Problem 1 might immediately be extended to this case. But it is physically reasonable to 
consider point-like charges, charges on surfaces, currents along lines etc. Then one cannot 
expect classical solutions of (4), (5) (by analogy to Problem 1). In a way analogous to 
Sec, 23.2., one will attempt to find distribution solutions. We use the same notations as in 
Subsec. 23.2.1. Thus in fhe OWE OL One Ho,7®6 means the tensor product of 
Eo,2€ DR), with respect to @, and 6€D(R,), with respect to ¢. For abbreviation we set 
G)@d=(£y 286, Eoy®6, Eo,,06) and analogously for $)@6. Finally, supp ECR} means 
that the supports of all components of © lie in 7. Analogous notations are used for other 
vector distributions. 

Problem 2 (initial value problems, distribution solutions). Given Co= (Lo, Foy, 
Ho.) and B= (Boe, Boy By) with components in JD), (Oy tha) ede 
components 'n D’(R,) and supp7c Ri, and e¢D'(R,) with supp ec R}. Desired 
are €=(E,, Ey, E,) and B=(B,, B,, B,), with components in D'(R,) and 


suppEc Ri, supp Bc Ri, such that 


oO aN 
rot 6+ Bw, 06, div 8=0, (6) 
a : 


Remark 3. This is the analogue to Def. 23.2.1(a), where distribution solutions of initial 
value problems for the wave equation were described. Asin Theorem 23.2.1, it can be expect- 
ed that distribution solutions are classical solutions, provided that all the data are sufficient- 
ly smooth. 

Theorem. Jf 7=0 and o=0, and tf the components of ©y and Bo are three times 
continuously differentiable on Rs, then Problem 1 and Problem 2 coincide. 


25.3.2. Initial Value Problems 


Theorem 1. Jf €o(%) and Bo(Z) are vector functions three times continuously 
differentiable on R3, with div Go(z)=div Bo(Z)=0, then Problem 1 has a unique 
solution. It is given by 


1 1 aft 
G@o-— f rot Bo(D) day +z =| i Gua) ay], 


‘ |g —z| =t fee eee 
1 
SRY ees aa : G7 il ioe ae a se WR (7 es 
Meng fm Gidr ee [> f wa) as,| 
ly—#|=t lg—z\=t 
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Remark 1. The explicit solutions G(#, #) and ¥(4, t) agree with the solution (19.3.3/2) for 
the wave equation in three dimensions (provided that wp and u, are chosen suitably). 
This also gives an indieation of how to treat the Maxwell equations. They are reduced to 
wave equations for &(%, £) and B(z. ¢). For curved space-times we shall describe this proce- 
dure in Sec. 33.5. 


Theorem 2. Problem 2 has a solution if and only af 


div j+ 2 =div 688 and div 8 j=0. (1) 
If this condition is fulfilled, then Problem 2 has a unique solution, which is given by 
E=G x (—arad o—2 + (rot Bi) 89-+6) 90), (2) 
B=G x (rot j— (rot Ey) @6 + By @s’) . (3) 


Remark 2. According to our previous convention, div, grad and rot always refer to 2, y, 2, 
but not to ¢. The above theorem is proved by reducing it to Theorem 23.2.2; in particular G 
is the fundamental solution of the three-dimensional wave equation as given in Theorem 
23.1.4. If all the distributions concerned are regular, then © and 8 can be represented 
explicitly in the sense of (23.2.2/2). The above theorem was proved by Leopold [38]. 


26. Self-Adjoint Operators in the Hilbert Space 


26.1. Unbounded Operators 
26.1.1. Closed Operators 


In Sec. 17.3. we developed the geometry of the Hilbert space. In particular 
we recall that a Hilbert space in the sense of our terminology is always complex 
and separable. Thus Hilbert spaces are special separable Banach spaces. Linear 
operators (also called mappings) were described in Def. 20.2.1. We shall now 
use the same notations as introduced there. Chaps. 20 and 21 contain a descrip- 
tion of the theory of bounded (or continuous) operators in Banach spaces and 
Hilbert spaces. But it turns out that many operators of mathematical and 
physical interest are not bounded. Here the so-called self-adjoint operators in 
the Hilbert space are in the centre of interest. In the present chapter we shall 
develop the theory of these operators. In this chapter we consider linear oper- 
ators A acting in a Hilbert space H, which means that the (linear) domain of 
definition D(A) as well as the range f(A) of A, in the sense set out in Subsecs. 
20.2.1. and 20.2.3., lie in 7. 


he 26. Operators in the Hilbert Space 26.1.8. 


Definition. Let H be a Hilbert space, and let A be a linear operator with D(A) cH 
and R(A)cdH. 
(a) Por x€ D(A) and y€ D(A) we define [x, yl= (x, y) + (Az, Ay). 

(b) A ws sazd to be closed if the Nmit element x of every convergent sequence 
(tp }e=1 C D(A) with Ax, + y belongs to D(A), and tf Ag, 
Remark 1. It is immediately verified that [v, y] is a scalar product on D(A). Hence it is 
possible to introduce a norm on D(A) by setting lll, =V fa, a]. The question is whether 
D(A) is complete with respect to this norm. 
Remark 2. Thus in part (b) of the definition one has to consider all convergent sequences 
{x} in D(A) whose images {A2,} are also convergent. This is a typical construction for 
not bounded operators. For, if A is bounded (in the sense of Def. 20.2.1) then Ly —>x mmMe- 
diately implies that Ax,, Ax. Hence, in particular, every bounded operator A with D(A) = 
=H and R(A) CH is closed. Thus it is especially not bounded operators that are of interest, 
as is also shown by the following theorem. 


Theorem. (a) /f A 7s a closed operator with D(A) =H, then A 7s bounded. 
(b) A ts closed tf and only tf D(A) is closed with respect to the norm |\x|| 4. 


Remark 3. The proof of part (a) is not so simple as it would be expected at a first glance. 
It is the converse of the statement made in Remark 2. 


Remark 4. Part (b) says D(A), with the norm l|zl|4, is a Banach space if A is closed. If this 
Banach space is separable, then we even have a Hilbert space, because ||a]| , has been derived 
from a scalar product. 


26.1.2. Closable Operators 


Definition. The near operator A is said to be closable if there exists a closed 
operator B, in the sense of Def. 26.1.1, which ts an extension of A. 
Remark 1. All operators in this chapter are linear and act in a Hilbert space Hf. An operator 


B is called extension of the operator A if D(B) > D(A) and Bx = Ax for x€D(A). For this we 
also write BDA or ACB. 


Theorem. Jf A is closable, then there exists a un ique minimal closed extension of A. 
It is denoted by A, and its domain of definition, D(A), is the completion of D(A) 
with respect to the norm ||-||4, as referred to in Remark 26.1.1/1. 

Remark 2. A is called closure of A. Hence the assertion is that BDA if Bis a closed ex- 
tension of A. The last part of the theorem says that D(A) consists of all elements x€H tor 
which there is a fundamental sequence (Cauchy sequence) {@x}e—1 in D(A) (with respect to 


the norm ||-|4) such that 2,2 in H as k-+. The statement that {x,};_, is a fundamental 
sequence in D(A) with respect to the norm 14 is equivalent with the statement that 


(ve}e-y and {Aap}, are fundamental sequences in H. Hence it follows that Ax, — Ae in addi- 
tion to 4,—>2. 


26.1.3. Adjoint Operators 


Definition. 7f A zs a Nnear operator whose domain of definition D(A) is dense in H, 
then we set 
D(A*)={y | yeH, Ay* cH, with (Ax, y)=(4, y*) for all Pal Ae 
A*y=y* for yeD(A*). 
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Remark 1. y* is uniquely determined. From (Aa, y) =(x, y*) =(x, y**) for all x€ D(A) it 
follows that (a, y* —y**)=0 for all x in the dense sct D(A). But then y*=y**. Hence 
A*y=y* is meaningful. It is immediately scen that D(A*) is a linear set in Hf and that A* is 
a linear operator in H. A* is called adjoint operator of A. This is obviously a gencralization 
Gl Ie Vil il se 


Theorem. Let A be a linear operator whose domain of definition D(A) isdense in H. 
(a) The adjoint operator A* ts closed. 
(Dy atf BoA den A* > B*, 
(ce) Zf A vs closable, then (A)* = A*. 


Remark 2. Proposition (b) is meaningful, because if D(A) is dense in H then D(B) is also 
dense in’ H. 


26.1.4. Symmetric and Self-Adjoint Operators 


Definition 1. Let A be a near operator whose domain of definition D(A) 7s dense 
in the Hilbert space H. 

(a) A ts called symmetric if (Ax, y)=(x, Ay) for all r€ D(A) and y€ D(A). 

(b) A ts called self-adjoint if A=A*. 


Remark 1. If 4 is self-adjoint, then (Az, y) =(a, A*y) =(a, Ay) for all x€.D(A) and ye D(A). 
Hence self-adjoint operators are symmetric. 


Theorem. (a) /f A 7s symmetric, then A is closuble, and A 7s also symmetric. 

(b) If B is a symmetric extension of A, then Bc A*. 

(c) A linear operator A with a dense domain of definition D(A) 7s symmetric 
if and only if (Ax, x) ts real for all x€ D(A). 
Remark 2. Part (b) says that every symmetric extension B of the operator A (which is 


necessarily also symmetric) is a restriction of A*. In particular, every self-adjoint operator 
has no proper symmetric extensions: 


Remark 3. The proof of (c) is largely based on the fact that H is a complex space. 


Definition 2. A symmetric operator A ts called essentially self-adjoint if its closure 
A is self-adjoint. 

Remark 4. By part (a) of the theorem the definition is meaningful. In the later applications — 
to quantum mechanics we shall be interested in self-adjoint operators. On the other hand, by 
Remark 26.1.2/2 the operator A is known if A is known. Thus for physical applications it 
will suffice to know that A 1s essentially self-adjoint. 


26.1.5. Criteria for the Self-Adjointness of Operators 


The self-adjoint and the essentially self-adjoint operators take a eentral 
position in the theory of this chapter as well as in the following applications. 
The criteria to be described now will already show what remarkable properties 
are exhibited by self-adjoint operators. 


Theorem 1. Let A be a symmetric operator. 
(a) If D(A) =H, then A ts self-adjoint and bounded. 
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aaa ane 
(b) A ts self-adjoint cf there exists « compler number 1 such that R(A -1E)= 
= Rh(A -AE)=H. : 
(¢) A ts self-adjoint if and only if R(A—AF)=H for every complex number 2 
with Im 2+ 0. . 


Remark 1. Part (a) is a simple implication of 4 = A* and Theorem 26.1.1(a). 


Remark 2. Part (c) shows that for self-adjoint operators 4 the behaviour of A —ZE will be 
of special interest if 2 is real. As usual, £ is the identity operator. Part (c) follows from the 
inequality 

|4a—Ax]| =|Im A] ||zl]| forall xé€D(d). 
Remark 3. As previously, we shall also here consider the family of operators A —/E# rather 
than an individual operator A. In this connection the following statement is of interest. 


If 4 is a complex number and if A is a symmetric operator, then (A —AE)*=A*—7#, 
where D(A —AH#) = D(A). Further, as in Theorem 21.2.2, one has 


H=R(A—-i1E)@N(A*-1E), 
where the symbols have the same meaning as in Subsecs. 24.2.1. and 21.2.2. 


Theorem 2. Let A bea self-adjoint, and Ba symmetric operator, with D(B) > D(A). 
If there exist two real numbers c and 6 such thatc=0,0=6< 1, and if 


|| Bxl| SdAz||+e lle VeeD(A), 
then A+B, with D(A+B)=D(A), ts self-adjoint. 
Remark 4. This theorem is of interest especially in physical applications. It says that the 


self-adjointness of an operator A is stable under the action of small symmetric perturba- 
tions B, 


26.2. The Spectrum of Self-Adjoint Operators 
26.2.1. The Spectra Dy and G, 


The resolvent set and the spectrum of a bounded operator were considered in 
Subsec. 20.2.3. Let us now extend these investigations to unbounded operators 
in the Hilbert space (,,unbounded“ means »not necessarily bounded“; thus in 
particular a bounded operator is a special unbounded operator: this is mathe- 
matical scientese). As everywhere in this chapter, all operators A are linear, 
the domain of definition D(A) and the range f(A) lie in one and the same Hilbert 
space H. & denotes the identity operator. L(H) has the same meaning as in 
Subsec. 20.2.3. Finally, C, is the complex plane. 


Definition 1. [f A ts a linear operator whose domain of definition D(A) is dense in 
the Hilbert space H, then 


M,={4|2€C), 3 (A-AB)-! and (A —AE)-1€L(H)} 


vs culled the resolvent set (of A),and S,=C 1\41 4 ts called the spectrum (of A). Fur- 
ther we define 


Dy={A| 3e€D(A) with |xl=1 and Ar=Azr}, 
and CO ,={a| 3 (A —2E)—!, but does not belong to L({H)}. 


Remark 1. The first part is largely a repetition of Def. 20.2.3. As previously, (4 —Ak)~1 
denotes the inverse operator of A —AH, which exists if and only if A —/E provides a one to 
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one mapping of D(A)=D(A—AE) onto f(d —AE). It is easily seen that all possibilities 
are exhausted by M4, D4 and C4. In particular one has Sua DUC y eNs previously, 
D 4 is the set of all eigenvalues of the operator A. If 26), is an eigenvalue, then x€ D(A), 
with «+0 and Aa=Az, is called the (corresponding) eigenelement (eigenvector). One has 
De =e 

Remark 2, Thus the spectrum S 4 is subdivided into the eigenvalue spectrum D, and the 
residual spectrum C 4. This suggests itself, but is not convenient for the following consider- 
ations. As in Def. 21.2.1 the dimension of (A —AE) is called the multiplicity of the eigen- 
value A4€D 4. It is our aim to slightly modify the spectra D4 and C 4. Roughly speaking, we 
want to take the eigenvalues of infinite multiplicities out of D4 and assign them to € 4. For 
a precise formulation we will need the concept of a Weyl sequence. 


Definition 2. 7f A ts a linear operator whose domain of definition D(A) is dense in 
the Hilbert space H, then {x,}¢_,C D(A) ts called Weyl sequence for the point A if 
{tp }e-, ts bounded and not precompact, and tf Ar, —Ax,-O0 as k>+o~, 


Remark 3. Precompact sets were described in Def. 20.1.2/1(c). The prototype of a bounded 


non-precompact set in the Hilbert space is a orthonormal sequence {zz}¢-. The operator 
A—AE transforms “poor (Weyl) sequences” (from the convergence point of view) into 
“good sequences’. One can at least surmise that points 4 for which there exists a Wey] 
sequence are of special interest. 


Theorem. Let A be a self-adjoint operator. 
(a) Sy 7s a subset of the real numbers, and 


(A —AE)— Hs for Ima+0O. 


1 
[Im | 


(b) 4 belongs to D4 if and only if R(A —AE) is a proper subspace of H. 
) 4 belongs to C 4 if and only if R(A —AE)=H and R(A-AE)+H. 
) For every point A€C 4 there exists a Weyl sequence. 
(e) For every eigenvalue of infinite multiplicity there exists a Weyl sequence. 


Remark 4. Part (a) is an implication of Theorem 26.1.5/1 and Remark 26.1.5/2. Likewise, it: 


is easy to see that proposition (e) is true: every orthonormal sequence {tp};  N(A —AEZ) is 
a Weyl sequence for the point A. 


26.2.2. The Spectra D, and Cy 


By means of Weyl sequences it is now possible to repartition the spectrum of 
self-adjoint operators. 


Definition. /f A ts a self-adjoint operator, then 
D4g={Al2 is an eigenvalue of finite multiplicity} 

is called the discrete spectrum (point spectrum), and 
C,={4| 4 a Weyl sequence for a} 

ts called the continuous spectrum of A. 


Remark 1. By Theorem 26.2.1 we have C4CCy, and De 14, further the eigenvalues of 
infinite multiplicities belong to C4. Hence it is clear that Cs4UD,=S,CC,4UD,. The 
arising problem of whether C’, can also contain points of the resolvent set is answered nega- 
tively by the following theorem. 
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Theorem. Let A be a self-adjoint operator. 
(a) Sao AOE. 
(b) ff 4,6 Da, We Da, Agy=A\ry, Ary —AsG, ond Y Aye 45, ena) — 


Remark 2. This is the desired better partition of the spectrum S 4 (better for our purposes). 
Part (b) follows from 


Ay(%4, Xo) = (AX. ®y) = (44, Aw) =Ag(24, Xo) « 


Hence eigenclements corresponding to different eigenvalues are always orthogonal. 


26.2.3. Compact Self-Adjoint Operators 


The theory of Riesz and Schauder presented in Sec. 21.2. shows that com- 
pact operators are of special importance. On the other hand it is shown by the 
preceding investigations that self-adjoint operators also play a special role. 

Thus it is clear that self-adjoint compact operators will play quite an extra- 
ordinary role. L(#) shall have the same meaning as in Subsec. 20.2.3. 


Theorem 1. /f A ¢ L(A) ts self-adjoint, then S4<[—||All, ||All], where at least one of 
the two potuts ||A|| and —|\Al| belongs to S 4. 


Remark 1. 84 c[—||Al|, |All] follows immediately from Theorem 26.2.1(a) and Theorem 
20.2.3. What is new in this theorem is the statement that either ||.A|| or —||A|| belong to the 
spectrum. 


Theorem 2. Let A €L(#) be self-adjoint and compact. 

(a) S4c[—|lA]], |All] and C4 = {0}. 

(b) D4 consists of an at most countably infinite number of different eigenvalues, 
which can accumulate only at the point 0. Every eigenvalue that is different from 0 
vs of finite multiplreay. 

(¢) There exists an orthonormal system {x,} cH such that Ax, =A,2, and 


Ag Age) ep far every elle (1) 


Remark 2. In the theorem it has been assumed that H is infinite-dimensional (otherwise 
C'4=0). The eigenvalues can be ordered by their absolute values, taking into account their 
multiplicities: |A,| = |Ag| =As| = ..., where A, +0 (in case there are infinitely many non-zero 
eigenvalues; Fig. 26.1). For a fixed eigenvaluc A, the characteristic manifold N(A—AE) 
can be spanned by an orthonormal system. The class of all these systems then forms the 
system {x;} in part (c). (By Theorem 26.2.2(b), eigenelements corresponding to different 
eigenvalues are always orthogonal). The sum in (1) needs only to be taken over those k for 
which A, +0. 


Theorem 3. A self-adjoint operator A € L(H) ts compact zf and ouly if C 4 = {0}. 
Remark 3. It has again been assumed that H is infinite-dimensional. 
Ay ay Ay 
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26.3. Spectral Families 
26.3.1. Definitions 


Definition. A family (Fac R, Of projectors tn the Hilbert space H 7s called a spectral 
family if it has the following properties: 
(a) For all x¢H, jim E,2=0 and dim [Dippiee re. 


(b) For all x€H and all we Ry, lim Dee Dee 
7] 
(ce) For all A4€ R, and we Ry, HE =H 


min(A,)° 


Remark 1. Projectors (or projection operators) were described in Subsec. 21.1.5. The con- 
vergences in (a) and (b) are of course to be understood in terms of the norm in H. As pre- 
viously. 4 t 4 means that A+ and A<y. By analogy we write A { w for Au and A>up. 
From (c) it follows that HiE,,=H,E;. Further E}=H,, which, by Theorem 21.1.5, is a 
necessary property of projectors. The same theorem shows that projectors are self-adjoint. 


Lenuna. Porallxr€H andall we Ry, Le Lx=E 9x exists. Here E,.9 is @ pro- 
jector. 


Remark 2. According to Remark 21.1.5/2, H*={x | x€H, E,a=2} is the projection space. 
corresponding to £;. By analogy we define H4+9 = {x | x¢€H, Ey, 9% =a}. 


Remark 3. The properties (a) and (c) in the above definition are essential, while (b) and the 
lemma are normalizations. 


26.3.2. Properties 


All notations to be used shall have the same meaning as above. 
Theorem. (a) For —o<Asy<o, H*cH"cH*"* and UH*=H. Further E.—-#E, 
a 


is the projector with the projection space H“OH*, and EB,,)—E, ts the projector 
with the projection space H"*° 6H’. , 

(b) For —0o<A<p<o, H**°CH", und E,,— Ey. ts the projector with the projec- 
tion space H" © H?*, 
Remark 1. (/“©H? is the space which complements H? orthogonally to H“ (Fig. 26.2) 


Hence, in terms of Subsce. 17.3.5. one has Hes H’@(H, OH’). An analogous relation holds 
for H4+06 74, 


Remark 2. Thus //4 is a monotonically increasing sequence of Hilbert spaces which exhaust 
the total Hilbert space //] as A: UH? is dense in I. 
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26.4. Spectral Operators 
26.4.1. Riemann-Stieltjes Integrals for Functions 


Before considering Riemann-Stieltjes integrals for spectral families in thefollow- 
ing subsections, it is useful to study corresponding integrals for functions. If 
h(t) is a real-valued increasing (non-decreasing) left-continuous function on FR, 
(Fig. 26.3), then h(t) generates a Borel measure dh(t) in the sense of Subsec. 
13.2.4. If ¢(¢) is a real-valued continuous function on Ry, and if —o<a=b<~o, 
then according to Chap. 14 we can form the integral f g(t) dh(t). Here {a, 5} 

{a,b} 


means (a,b) or [a@, 6) or (a, d] or la, b] Ge, 6} ={a} is also admitted). If y(¢) is 
complex-valued, then of course we set 


f p(t) dh(t)=f Re y(t) dh(t)+if Im ¢(é) dh(d) . 
[ao 


Fig. 26.3 
But this special case of the general integration theory can also be treated in 
terms of the Riemann integral discussed in Sec. 3.2. Let a=ag<a,<...<a,< 
—G,.—0 be the partitionsot 03.2511) eandelemd(7)— mia slay aa el 
t-€[4,,d¢44], then ieee 

g(t) dh(t = Gm 2 pty) (ees) —hlaz)) . (1) 

[4,b) (2) >0 xa 

It {a, b}=(a, b), then one has to replace h(a) in (1) by h(ag +0). Lf {a, 6} = (a, d], 
then in (1) we have to replace h(ay) by h(ay+9) and h(a,.,) by h(a,.,+0). Hf 
{a, b}=[a, 6], then in (1) we have to replace h(a,,,) by h(a,,1+0). (1) and the 
modifications indicated are the analogue to Lemma 3.2.1/2. The formulation 
also makes it clear that (1) is independent of the particular choice of the ¢,. If 
Jr t) dh(t) exists (in the sense of the integration theory described in Chap. 14) 


a it is clear that 


felt) da(t)= lim f(t) dh(t). (Dy 
Ti AV 00 (Sry 


-—WV,2 


Complex distribution functions: If h(t)=h,(t)—ho(t)+ih3(t)—ih,(t), where the 
h,(t) are real-valued increasing left-continuous functions, then one sets 


eee TS dh (d) 5 dho(?) (3) 
+i f p(t) dhs(t)—i f p(t) dh,(t) - 
Rt hy 


Here ¢(/) 1s a complex-valued continuous function on R, for which all integrals 
on the nght-hand side of (3) exist. It is clear that f (¢) dh(¢) has the usual proper- 
ties of integrals. 5) 
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Examples. Let {£,)}.cp, be a spectral family. Then h(t) =||#,x||2, with rei, is a 
real-valued increasing left-continuous function. If x¢H and yeu, then (= 
=(E,x, y) is a complex distribution function in the above sense. This follows 
from 


re Pte || lee gi? ||. rtiyl|e . x— iy ||? 
(Ex, y)=|| By — —|| B, sel + EK, 5 | - H, ! 


Thus we can consider Riemann-Stieltjes integrals of the form 
SPU) AFE,x|? and  f g(t) d(E,x, y) . 
R, Ri 


26.4.2. Riemann-Stieltjes Integrals for Spectral Families on Finite Intervals 


Our aim is to replace the function A(t) referred to in the preceding subsection 
by the function E,.x, whose range is a Hilbert space. Here {E ter, is a spectral 
family, and x¢H. The symbols to be used shall have the same meaning as in the 
preceding subsection, i.e. -»<a=b<o, the partition a=ay<a,<... ete eng 0) 
the points ¢,, and d(Z). 


Theorem. Let {E,jier, be a spectral family in the Hilbert space H. If p(t) is a com- 
plex-valued continuous function on Ry, then, for every x€H, 


Rn 
h GE 2 gee) t) dE,x 1 
d(Z)~0 = P(t) ( ap, i a) a ) dk, (1) 
exists as a limittn H. The limit element belongs to the projection space Pi B, = E., 
and 


Ife d£2\2= f lp@led | £22. (2) 
{a,b) {a,b) 
Lf «,€H and xo€H, and if 0, and 05 are complex numbers, then 
S p(t) AE, (or; + 00%) =0, f p(t) dE, t+o2 f p(t) dE yx . (3) 
[4,b) [a,b) [4,b) 


Remiark 1. It is clear how one has to understand (1): the left-hand side of (1) exists as a limit 
in H, which is independent of the particular choice of the partitions and independent of 
the points t,. The right-hand side of (1) 1s then the definition of a Riemann-Stieltjes integral 
whose values are elements of a Hilbert space, by analogy with (26.4.1/1). The fact that 
this integral belongs to the projection space of #, —£, is verified rather easily. The right- 
hand side of (2) is an ordinary Riemann-Stieltjes integral for functions in the sense of 
Subsec. 26.4.1. 


Remark 2. By analogy with Subsec. 26.4.1. one may introduce corresponding integrals 
f p(t) dfx for the other interval types. If one replaces L,,” in (1) by Hy,,0%, then one 


{a,b} 
obtains f p(t) d#,x. Further, 


(a,b) 
f PO) dBe= Sf olt) ye +(b) (Bry Hy), 
(a,b] (a,b) 
f plt)dBe= f plt) dE + eb) (By,o— Bi)a- 
[a,b] (a,b) 


(2) and (3) remain true if [a, 6) is replaced by {a, b}. Further the integrals belong to the 
corresponding projection spaces, €.g., 
f p(t) diac WH’ oH? , 


(a,6) 
Sf pt) dLx= (a) (Hey jecH? "Ol ‘ 
{a} 
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26.4.3. Riemann-Stieltjes Integrals for Speetral Families on R, 


By analogy with (26.4.1/2) let us extend the considerations of the preceding 
subsection to Rk, As usual, H is a Hilbert space. 


Theorem. Let {E,}icr, be @ spectral family, let p(t) be a complex-valued continuous 
function on Ry, and let 


M={ylyéH, fle@led|Eyl2<-}. (1) 
Ry, 


(a) M as a linear set dense in H. 
(b) lim ff g(t) dk,x= fot) dH,r 2 
N= (_ HN) Ry 
exists tf and only af xe M. 
(e) For xe M, 
Fi p(t) ABa\?= fip(t)? d ||E,2|? . (3) 


*) 


ei) 


Remark. In (2) it is possible to replace lim j) bye Shwe ff, The i leew heme iis 
; oy 2 ee 


right-hand side of (2) is the definition of the corresponding integral, provided that the 
left-hand side of (2) exists. 


26.4.4. Spectral Operators 


Definition. 7f {Lier ts a spectral family, and tf (t) is a complex-valued continu- 
ous function on Ry, then A, given by 
Aa fod a 
qty 
uith the domain of definition 


D(A) = {x if p(t)? d || B,x|?< =} 


as a spectral operator. 


Remark 1, From Theorem 26.4.3 and from (26.4.2/3) it is easily concluded that A is a 
linear operator whose domain of definition D(A) is dense in H. This raises two questions: (1) 
What are the properties of spectral operators? (2) How general is this concept, which oper- 
ators can be represented as spectral operators? 


Theorem 1, Let A be a spectral operator with respect to the spectral famil y {Evter, 
and the complex-valued continuous function @(t). 

(a) A ts a closed operator with a dense domain of definition D(A). 

(b) The adjoint operator A* is also a spectral operator, and 


A*x= fp(t)dE,x and D(A*)=D(A). (1) 
R, 
(¢) For «€ D(A) and yéeH, 
(Ax, y)= i g(t) d( Ey, y) . (2) 


(d) A as self-adjoint if and only tf y(t) is real-valued. 
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Remark 2. Since D(A) is dense in the Hilbert space I, by Subsec. 26.1.3. it is possible to 
form the adjoint operator A*. Then (d) follows from (1). By Subsec. 26.4.1. formula (2) is 
meaningful, where the formulation in part (c) is meant to include the convergence of the 
integral in (2). 


Theorem 2. If {E,icr, ts a spectral family, and af p(t) and p(t) are any two contin- 
uous functions on R,, then let 


Ar= ftdEx,  x€D(A), (3) 
Ry 
g(A) r= a q(t) dE,a, x€D(y(A)) , (4) 


with an analogous formula assumed for y(A). Then 


¢(A) + y(A) = y(A) - @(A)= (vy) (A). (5) 
Further ¢(A)=A" ¢f o(t)=t", n=1, 2, 3, ... 
Remark 3. By Theorem 1, 4 is self-adjoint. (4) then says that we can form functions of 
self-adjoint spectral operators. (5) shows that operator functions can be handled in calcula- 
tions like ordinary functions. Here the domain of definition, 

D(¢(A) y(4)) = {a | xED(y(A)),  y(A)x€D(g(A))} 
is the set of all elements x€H on which (A) (y(A) x) can be carried out. Then g(A) y(A) is 


a closable operator, and g(A)y(A) is the closure. The last assertion of the theorem shows 
that this construction is compatible with the formation of the product, A2= 4A, A" =AA”-!, 


26.4.5. The Fundamental Theorem of Spectral Theory 


Theorem. Every self-adjoint operator A in the Hilbert space H, with the domain of 
definition D(A), has a unique spectral family {E,}1¢n, such that A and D(A) can be 
represented as 


Axv= fidE,r, D(A)={rlxreH, fedlEa2<—}. 
Ry, Ry 


Remark 1. This is one of the most fundamental and most beautiful theorems of operator 
theory. It shows that (26.4.4/3) does not describe a narrow subset of the self-adjoint oper- 
ators (as might be guessed at a first glance), but rather represents the most general self- 
adjoint operator. Thus in particular we can form operator functions g(A) of arbitrary self- 
adjoint operators A. sin A, |A], or )'A] are meaningful, because sin ¢, {t], and V|t] are ad- 
missible functions. 

Remark 2. The proofs for the theorems in Subsec. 26.4.4. are not trivial, but yet compara- 
tively simple. On the other hand a complete proof of the above theorem is highly intricate. 


26.4.6. The Spectrum of Self-Adjoint Operators 


In Subsee. 26.2.2. we eonsidered the spectra Dy, C, and S, of a self-adjoint 
operator A. Now we think of A as being represented in the form of Theorem 
26.4.5. The spaces H’ and H’*® shall have the same meaning as in Subsec. 26.3.1. 
with respeet to the speetral family {L,}:er,, which, by Theorem 26.4.5, is uniquely 
determined by A. 
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Theorem. (a) For every real number 2, N(A—AF)=H°" OH’. 
(b) The real number 2 belongs to C4 if and only if H**® OH? * is infinite-dimen- 
sional for every number e=0. 


Remark. If / is an eigenvalue, then //4+ © H? is the corresponding characteristic manifold, 
i.e., the set of all eigenelements for the cigenvalue A and the zero element. A€ D4 is true if and 
only if 144° @H?* is finite-dimensional but consists not only of the zero element. The 
theorem thus allows a remarkable characterization of the spectra D4 and C4 by properties 
of the corresponding spectral family in the sense of Theorem 26.4.5. This shows the close 
relation between spectrum and spectral family. 


26.4.7. Operators with a Pure Point Spectrum 


The spectra D4 and C, fora self-adjoint operator A shall have the same mean- 
ing as in Subsece. 26.2.2. 


Definition. A self-adjoint operator A is called operator with a pure point spectrum tf 
C4 = 0. 


Remark 1. Thus, by Theorem 26.2.2, the spectrum S 4=D 4 consists only of eigenvalues of 
finite multiplicities. Such operators play an important role in physical applications. 


Theorem 1. An operator with a pure point spectrum is not bounded. 


Remark 2. Hence a bounded self-adjoint operator has a nonvoid continuous spectrum C 4. 
As previously, here we have again tacitly assumed that #7 is infinite-dimensional. 


Theorem 2. Let A be an operator with a pure point spectrum. Then A has u count- 
ably infinite number of different eigenvulues of finite multiplicities, which cannot 
accumulate in finiteness. Further there exists a complete orthonormal system (x, }?_ ne 
eA sucha! 7, J, 27, 


D(A)= v | fei, a Ii I(x, tk) [2 < 00 > (1) 
= : 
Ara > Aglas @,) ty fOr ce DAE ; (2) 


Remark 8. This is the analogue to Theorem 26.2.3/2. As we did there, we have assumed that 
H is infinite-dimensional. If the eigenvalues are ordered, taking into account their multi- 
plicities, then one obtains 0=||A,=|A9| = ..., with |A,|-~o as ko. In every characteristic 
manifold one can freely choose an orthonormal system. The set of all these systems then 
forms the system {x;} referred to in the theorem. (By Theorem 26.2.2(b) the eigenelements 
corresponding to different eigenvalues are always orthogonal). 


Remark 4. The representation (1), (2) is the concrete formulation of Theorem 26.4.5, repre- 
senting the infinite-dimensional variant of the well-known transformation to principal axes 
of surfaces of second order in analytical geometry. Theorem 26.4.5 then is the continuous 
analogue. 
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27.1.1. Introduetory Remark 


The spectral theory presented in Chap. 26, and especially Theorem 26.4.7/2 
and Remark 26.4.7/3, van be used effectively in studying ordinary and partial 
differential operators. The method can be outlined as follows. One considers a 
differeutial expression, e.g.. 4y= — y” in the interval (a,b), where —o<a<h<«. 
Now one looks for suitable domains of definition D(A), such that A becomes es- 
sentially self-adjoint (cf. Def. 26.1.4/2) in the Hilbert space L,(a, b). Then the 
closure A is self-adjoint, and the theory of Chap. 26 is applicable to A. What is 
aiined at is that 4 is an operator with a pure point spectrum in the sense of Def. 
26.4.7. Then it is possible to make use of Theorem 26.4.7/2 and Remark 26.4.7/3. 
We attempt an explicit determination of the eigenvalues and eigenelements 
(eigenvectors) of A. Then it follows from Subsec. 26.4.7. that the corresponding 
orthonormal system of eigenelements is complete in the Hilbert space concerned. 
Subsequently it is possible to expand arbitrary elements of the Hilbert space 
considered in terms of such systems. Thus one has a situation as described in 
Chap. 18 for n-dimensional trigonometric series and Legendre polynomials. The 
usefulness of such orthogonal expansions can be evaluated from, say, Sec. 19.5., 
where separation setups for partial differential equations have been treated on 
this basis. For us, however, the considerations to be presented in this chapter are 
also intended as a preparation for the following chapter on quantum mechanics. 


27.1.2. Trigonometric Functions 


In this subsection we take as a basis the (complex) Hilbert space H = L,(Q), 
with Q=(a, b), —o<a<b<~, in the sense of Subsee. 17.3.2. Instead of L,((a, b)) 
we use the simpler notation L.(a, 6b). By C~([a, b}]) we denote the class of all 
com plex-valued functions differentiable up to arbitrary order on (a, b), all deriv- 
atives of which can be continued coutinuously to [a, b]. 


Theorem. (a) The operator (A pf)(x) = —f’' (x) with the domain of definition 
D(Ap)=tf | fee>([a, b)), a) = f(b) =9} 
ts essentially self-adjoint in Lo(a, b). Ap is au operator with a pure point spectrum. 


ae e mt 
The eigenvalues are A= , with k=1, 2,... These eagénvalues are 


(aa) 
2 r—a ‘ 
simple ones, and fule)=| b sin wk are the corresponding orthonor- 


‘ 5 —a b—« 
mal eigenfunctions. _ 4 
(b) The operator (Ayf)(x)= — f(x) with the domain of definition 


D(Aw)={f | feCr(Le, b)), fe) = f(b) = 05 


Be ollie 
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is essentially self-adjoint in Lo(a, b). Ay is an operator with a pure point spectrum. 


279 
Teak? ; 
The etgenvalues are Sane , with k=0,1,2,... These eigenvalues are 
. —ii = 
i) Z _2-a : 
simple ones, and fo(x)=———= as well as f(x) = |/—— cos —— al, with 
b—a b-a —a 
(=1, 2, ..., are the corresponding orthonormal ergenfunctions. 


Remark. As was already mentioned in Subsec. 27.1.1., it follows from this theorem as well 
as from the considerations in Subsec. 26.4.7. that both 


(2a ee 4 1 = oe 
| ssn - an — bua bay - 


are complete orthonormal systems in L,(a, b). On the basis of the spectral theory of self- 
adjoint operators, for the one-dimensional case we have thus obtained the same result as in 
Theorem 18.1.4 and in Remark 18.1.4/1. These considerations can also be extended to the 


n-dimensional case. 


27.1.3. Hermite’s Functions 


Let Cy(#,) = D(R,) be the class of all complex functions with bounded sup- 
port that are differentiable up to arbitrary order on R,; cf. Def. 14.6.4/2 and 
Subsec. 22.1.2. 


Theorem. The Hermitian differential operator (Af)(x) = —f’’ (x) +xf(a), with the 
domain of definition D(A) =OF(R,), ts essentially self-adjoint in Lo(R,). A ts an 
operator with a pure point spectrum. The ecgenvalues are A, =2k +1, withk=0, 1, 2, ... 
These eigenvalues are simple ones, and with suttably chosen positive constants Cz, 
te 
ea! —a2 

A(x) =cye dax* ae) 

are the corresponding orthonormal eigenfunctions. 


Remark 1. The first part of the theorem can be essentially generalized: if p(x) is a real- 
valued continuous function on R,, such that p(x) > as |x| >, then 


(Af)(x) = — f(x) +p(a) fz), D(A) =CZ(R,) , 


is essentially self-adjoint in Lo(R,). Further, 4 is an operator with a pure point spectrum in 
this case, too. 


Remark 2. The considerations in Subsec. 26.4.7. show that {H;,(x)};-.9 is a complete ortho- 
normal system in Lo(,). Thus it is possible to represent every function {(x) €L,(R,) as 


f(x) - 2, (f, He) z.(R,) Hy() « 
If A is not an eigenvalue, then the (generalized) Hermite differential equation Ag—jg=f 
has the unique solution 


me: Sy 2a) 
es Oe 


H;,(x) 6 
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Remark 3. It is immediately seen that 
a dé 
hy(a)=—e2 H,(a) =e?’ — (e-@? 
tila) = =e? Hala) =e* = (e-*) 
where k=0, 1, 2. .... is a polynomial of degree k. These are the well-known Hermite poly- 
nomials. 


27.1.4. Legendre’s Functions 


In Subsec. 18.2.2. we already investigated Legendre polynomials. On the 
basis of the spectral theory of self-adjoint operators we shall now essentially ex- 
tend the considerations described there. 


Theorem t. The Legendre differential operator (Af)(x) = —((1—2?) f’(x))’, with 
the domain of definition D(A) =C=([ —1, 1)), is essentially self-adjoint in Ly( —1, 1). 
al is an operator with a pure point spectrum. The eigenvalues are Ay =k(k +1), with 
k=0, 1, 2, ... These eigenvalues are simple ones, and the Legendre polynomials 


2k+1.,1 a’ an 
Iy(a) = / 2 Plage reel (1) 
are the corresponding orthonormal eigenfunctions. 


Remark 1. The notations C~({—1, 1]) and L,(—1, 1) have the same meaning as in Subsec. 
27.1.2. Apart from normalizing factors, (1) agrees with (18.2.2/1). Again it follows from 


Subsec. 26.4.7. that {L;(x)};9 is complete in Z.(—1,1). Thus an arbitrary function 
f(x) €Lo(—1, 1) can be expanded in terms of Legendre polynomials, as was stated in Remark 


ode ae | ae 


Theorem 2. For m=0, 1, 2, ..., the differential operator 


(Anf)(2) = —((1=22) f') +5 fla), with 


D(Am)={f | f(x) =(1—22)? P(x), P(x) polynomial} 
as a domain of definition, is essentially self-adjoint in L(—1, 1). Ap, is an opera- 
tor with a pure point spectrum. The eigenvalues are A, =k(k +1), with k=m,m-+1, 
m+2,... These eigenvalues are simple ones, and 

k—m)! = d™ 

= as 1— x2)? — I(x 

LE) = | ey 1)” gow Pale) 
are the corresponding orthonormal eigenfunctions. 
Li(x) are called associated Legendre functions. Again the completeness of the system 


{Li (x)}z—m in Lo(—1, 1) is deduced from Subsec. 26.4.7. 


Remark 2. For m= 0 one largely obtains Theorem 1. As usual we have set 0!=1 (in (1) too). 


27.1.5. Laguerre’s Functions 


By O=([0, <)) we denote the class of all complex-valued functions f(x) differ- 
entiable up to arbitrary order on (0, -), which vanish for large values of x and 
all of whose derivatives can be continued continuously to the point 0. 
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Theorem. For «> —1, the Laquerre differential operator 


a : 

(Af) (2) = —A( xf’ (w))’ + (+=) fle), with 

D(A,)={f |x * f(x)EC=([0, -))} 
as a domain of definition, ts essentially self-adjoint in the Hilbert space L(0, co). A, 
is an operator utth a pure point spectrum. The eigenvalues are Ay =2(2k+1+2«), 
with k=0, 1, 2,... These eigenvalues are siniple ones, and 

v a k 

>= -~d 
= Ope 8 
( ) kya da? 
with a surtable choice of the positive constants Cra, are the corresponding orthonormal 
eigenfunctions. 


L 


-2, firey 


(em 


ake 


Remark 1. Z, (x) are the Laguerre functions. Again it follows from Subsec. 26.4.7. that 


{L,,4(%)};0 is a complete orthonormal system in L,(0, «). In particular it is possible to 
expand every function f(x) €L,(0, -) in terms of Laguerre functions, that means, 


fla) = Be Lap) 1m) Eas 


Remark 2. It is immediately seen that 
Teac HE dt 

® *@2lh, AaSeu 
afi ) dak 


Ly a(@) = (Capra) 


Chia 
are polynomials, which are called Laguerre polynomials. 


Remark 3. The above theorem was established by U. Griinewald (dissertation, Jena, 1975). 


27.2. Surface Harmonies 
27.2.1. Beltrami’s Differential Operator 


n 2 
The Laplace operator A= >, i 

pas 
gated in detail in Chaps. 19 and 23. Later on, in Chap. 33, we shall consider the 
Laplacian and the wave operator in general curved spaces. At present we are 
interested in the analogue to A on the unit sphere in R,, i.e., on w,={2 | || =1} 
(Fig. 27.1). Since we do not intend to use the (rather complicated) techniques to 
be introduced later on, we content ourselves with a specific construction. Let 9 be 
the general point on w,, and let C=(w,) be the class of all complex-valued func- 


in fk, and in domains Qc R,, was investi- 


oY 9 y ° 
EL, 27.2. Surface Harmonics AST 


tions that are differentiable on , up to arbitrary order. If ds is the surface ele- 
ment on @, in the sense of Subsec. 9.2.5., then 


(1 Dixwy= SH) g)ds, fEO(o,), fEC™(an), (1) 
: Op, 

is a scalar product. Let Lo(w,) be the Hilbert space obtained by completion of 

C*(@,) with respect to the norm corresponding to (1). This space can also be de- 

scribed as follows. Let r=|x| be the distance of wé R,, from the point 0. Then let 


Oye {« | —< r=3] . Now it is possible to identify r¢Q uniquely by r and 9¢a,, 


«= (r, 3). We now continue f¢€O*(o,) continuously to Q by setting fo(r) =/(9) 
(this means that fo(x) is constant along the radii from the origin). Now it is pos- 
sible to regard Lo(w,) as a completion of {fo(x) with f(9) €O*(w,)} in L5(Q), ie., as 
a subspace of L,(Q). Such a point of view (though possibly not quite satisfactory 
from the logical aspect) will suffice for our purposes. ; 
Definition. (Bf)(9)= —r?Afo(x), with fEO*(w,), 7s called Beltrami’s (second) 
differential operator, n=2. 
Remark 1. If fEO-(a,), then fo(x) with «€Q has the above meaning. Then 
n Bo 

(2) 


—r?A =—(x?+ ... +03 
r2Afo(a) (x; +24) 2 jin? 


is a function differentiable up to arbitrary order on Q. If we again set «=(7, 0), then it 
turns out that this function does not depend on 1, i.e., that it is constant along the radii 
from the origin. According to the above identification one then has —r2Afo(2)€C™(w,). 
It is in this sense that the above definition has to be understood. One may ask for an 
intrinsic description of B, which does not make use of the continuation from w, to Q. 
Systematic considerations of this kind are described in Chap. 33. On the other hand it is 
possible to describe 6 explicitly by means of n-dimensional polar co-ordinates; cf. [66], 
p. 418. In the following remark we confine ourselves to the cases n =2 and 2n=3. 


Remark 2. In the case n=2 we identify the points 0 €w, in the usual way by the angle & 
(Fig. 27.2). Here 0=%<2z7. With respect to the polar co-ordinates (7, 4) one obtains 


abe 


i taf odu\ 10% 
Me) = ae" Or) T9298?” 
ne ep 
Tein, 27 2 
ga} 2 1 /eEc~ In tl =3 we use the polar co-ordi- 
Hence (£/)(9) = — 555 for n=2 and f€C™(m5). In the case n=3 we us P 


nates defined in Subsec. 12.4.1., which gives 


1 ju { Ay 
Au(x) = sa ( a =) a (sin o -) “ ae 


Par \" dr)? sin 8 00 90) Tsim? Op? 
Hence “ 
il ees ae 
(Bf)(d, v) = = aa Op (sin o +a) — sin? Dope (2) 


for n=3 and f€C*(w3). The disadvantage of (2) is that the polar co-ordinate system (r, 0, 7) 
becomes singular for #=0 and d=7. 


wo 
~ 
Xe) 
(ea) 


288 - 27. Differential Operators 


27.2.2. Surface Harmonies as Eigenfunctions 


In Remark 23.1.2/2 we described harmonic polynomials P(x) in R,. A poly- 


nomial is called homogeneous if P(x) = >, 0.x", where, ‘as previously, x= 
[<j=m 


=%,'..2,". Such a homogeneous polynomial can be written as P(x) =r™S(8), 
where S(#)€C"(w,). We shall retain all the notations of Subsec. 27.2.1. 


Definition. Zf k=0, 1, 2,..., then S(#)€O7(@,) ts called surface harmonic of 
degree k if r*8(9) = P(x) is a homogeneous harmonic polynomial in R,,. 


ab 
Remark t. S()=1 is a surface harmonic of degree zero, S()=— is one of degree one, 


Boy 

S(d) = —,— Is one of degree two, etc. 
r 

Theorem. Beltrami’s differential operator Bas per Def. 27.2.1 ws essentially self- 

adjoint in the Hilbert space Ly(w,). B is an operator with a pure point spectrum. 

The different eigenvalues are 4, =k(k +n—2), with k=0, 1, 2, ... The multiplicity 

of the eigenvalue A, is V,(k) = (a aa | — es 2 


teristic manifold consists of the surface harmonics of degree k. 


. The corresponding charac- 


Remark 2. Here ()=0 if m<k. From the theorem it follows that there are V (A) linearly 
independent homogeneous harmonic polynomials of degree k in R,. Further, the charac- 
teristic manifold corresponding to the eigenvalue J, can be spanned by V,(k) orthonormal 


surface harmonics S¢ (9). Since eigenfunctions corresponding to different eigenvalues are 


always orthogonal, it then follows from Subsec. 26.4.7. that pee isa complete 
Pte Ueal(S)) 


orthonormal system in the space Lo(w,). An arbitrary function /(#) € L5(w,) can thus be ex- 
panded in L(w,) in terms of surface harmonics, 
Se, (k) 
HP)= ZB (SP?) to(wySe(8)- 
ses p=t 
Remark 3. Of special interest are the cases n =2 and n—3. One has V,(0)=1 and V.(k) =2 
for k=1, 2, 3, ... Further, V3(k)=2k+1. The case n=3 will be dealt with in greater detail 
in the next subsection. For n=2 it follows from Remark 27.2.1/2 that (BAP) = —f’(9). 
In this case the eigenvalues are 4, = k2, where k=0, 1, 2,... The corresponding orthononormal 


1 ; : 

eigenfunctions are ~—e+ik? (for k=1, 2,3, ... one has two functions each, and one func- 
tion for k=0). Hence 

as , obs et, f. eniko|” 

yom Vis Vin os 
is a complete orthonormal system in the Hilbert space Ly() = £,(0, 27). This agrees with 
Remark 18.1.1/4 and Theorem 18.1.3. Thus, in the case n=2, rket*0=(x,+ix)* are 
harmonic polynomials, provided that the representation in polar co-ordinates, X,=?7 cos B, 
X= sin , is used. 
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27.2.3. Three-Dimensional Surface Hannuonies 


We uow discuss the case n =3. As in Remark 27.2.1/2, let (7, 8, ») be the polar 
co-ordinates referred to in Subsee. 12.4.1., O<r<o, 0<#<n, 0=9<2n. The 
Beltrami operator B now has the same meaning as in (27.2.1/1). Finally, we re- 
eall the associated Legendre functions L(x) referred to in Subsec. 27.1.4. 


Theorem. {S{(8, P) be=0,1. 2,00 , given by 
m=—k,—k+1,.0.0,...46 —18 


SO(8, ¢)=—= Liml(cos 9) el” , (1) 


Sj 
aie 


is a complete orthonormal system of three-dimensional surface harmonics. 


Remark 1. Here Sg BS Ge . 18 an orthonormal system of 2k-+-1 three-di- 
P) fma—k,...,0,..5% ny 

mensional surface harmonics of degree /. Every other surface harmonic of degree k can be 
represented as a linear combination of these 2k+1 surface harmonies of degree k. Then it 
follows fron Theorem 27.2.2 that these surface harmonics are the eigenfunctions of the three- 
dimensional Beltrami operator in (27.2.1/1). 

Remark 2. As was already pointed out repeatedly, every function /(9, ~) €L5(w3) can be 
expanded in terms of the system cSt Further it can be observed from the above consid- 
erations that ks 9. y) are homogeneous harmonic polynomials of degree k in R3, 


...1t is this as well as the numerous other ap- 
plicationsfrom which matheimatics derives what- 
ever reputation it enjoys in the wider public. 
(Hilbert, Naturerkennen und Logik, 1930) 


28. Principles of Quantum Mechanics 


28.1. Axiomatics of Quantum Mechanics 


28.1.1. The Hilbert Space Model 


In Sec. 12.1. we discussed mathematical models of physieal theorics. Iu the 
case of quantum mechanies, the schematic representation of Subsec. 12.1.2. can 
be reified as follows, where the numbers indieate the sections in which the subjects 
in question are dealt with (Fig. 28.1). In this section we shall consider the cor- 
responding mathematical theory, referring to the spectral theory of self-adjoint 
operators of Chap. 26. As in this theory, we also here assume // to be always a 
(complex separable) infinite-dimensional Hilbert space. 
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quantization — quantum 
763. mechanics 


Hilbert space 
model , 28.1. 


interpretation | 28.2. 
Fig. 28.1 


Definition, A quantum mechanical system © ={K, VY} consists of a self-adjoint 
operator Min H and a one-dimensional subspace VY in H. 3 is called the Hamil- 
tontan operator of S, and ¥ the state of S. 


Remark 1. What we are considering now is the definition of mathematical concepts which, 
for the time being, have no physical content. The wording, however, already indicates later 
physical interpretations. 

Remark 2. A one-dimensional subspace WY of Hf consists of all elements Ay, where y¢H, 
with ||y]|=1, is a fixed element and A is an arbitrary complex number. Here wp is unique ex- 
cept for a complex factor of absolute value 1. In this sense, from now on we shall also call 
S={#, y}, with |yl|=1, a quantum mechanical system, and y the corresponding state. 
Thus y may also be replaced by peig, where 9 is a real number. Consequently, all the concepts 
to be defined must be invariant under the substitution y —yeie. 


Now I know for certain that the quantum of action 
has a much more fundamental significance than I 
originally suspected. 

(M. Planck, at the end of his life) 


28.1.2. The Dynaimies of Quantum Mechanical Systems 


We consider a quantum mechanical system that depends on time (more pre- 
cisely, on a real parameter that we shall call time), G(¢) = {3f, p(#)}. Here the 
Hamiltonian operator 3((¢) =9 is assumed to be independent of time, so that it is 
only the state y(t) which is variable. In the sense of Remark 28.1.1/2 one has 
\|p(t)|| = 1. We shall now establish by axiom what y(é) will be for ¢=0 if y(0) and && 


h : 

are known. Here eee h being Planck’s quantum of action, h=6.62 - 10-27 
TG 

erg - sec. !) ‘ 


Axiom. The quantum mechanical system © = {H, p} at the time t=0 transforms into 
t 


: Ae . —i—it A 
the quantum mechanical system S(t) = {, y(t}, with yi)=e  wpatthetinet=0. 
=i g ria 
Remark 1. Heree 4 has the meaning indicated in Theorem 26.4.5 and Theorem 26.4.4/2, 
' t 
r ee ; y , =i ioe 

with d= and p(t)=e 4. According to Subsec. 26.4.4. and Theorem 26.4.3,e Nn isa 
unitary operator in the Hilbert space /Z. Hence one has {hw(é)|| =I}y|| =1. In view of Remark 
28.1.1/2, the axiom is meaningful. It describes the dynamies of the system G: the variation 
of the state y(t) when the Hamiltonian operator 3 is constant. 

‘) KB. Segré writes in [57, pp. 61, 87]: “It often seems as if physics follows a predestined 
line and that great scientists have simply accelerated progress. If one scientist had not been 
there, another would soon have taken his place and found the same thing. One important 
exception to this is the discovery of the quantum of action... The whole tone of Planck’ s 
first. and even later, work gives the impression that quantization was little more to him 
than a caleulational device.” 
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Remark 2 (Schrédinger’s equation). By analogy with R, and Cj, it is possible to introduce 
a concept of deni anes in LF. {p(t)};=o9 CH is called differentiable if, for ¢=0, 
. w(t +0) —y(t) 
hay ———_ 
o~0 rc) 
exists. (For t=0, take the limit on the right, 0 | 0.) If p(t ) has the meaning set out in the 


axiom, and if p€D(i), then it can be show n that y(t) is differentiable and solves the ab- 
stract Schrédinger equation 


yO= =, Hy. yO) =yeDO). (1) 


=H) va JAI 


In particular one always has y(t) € D(a). The following converse can also be shown to be 
true: if y¢D(dt), then the differential equation (1) has a unique solution y(t), with p(0) =y. 
For py ¢D(2) the axiom and (1) are equivalent. The axiom is more general, becausc it applies 
to arbitrary y¢H, whereas (1) is more elegant. As the states of physical interest, which we 
shall consider later on, belong to D(3), there is no substantial difference between the ab- 


stract Schrédinger equation (1) and the axiom. 
} 


28.1.3. Stationary States 


Pie ee A quantum mechanical system © = {2, y} is called stationary (or stable) 
if S(t) =S for t=0 in the sense of Axiom 28.1.2. 


Remark 1. According to Axiom 28.1.2 and Remark 28.1.1/2, S is stationary if there exists a 
real-valued function o(t) such that y(t) =ele@®y. This means that the system always remains 
in the same state. Such stationary states are of special interest. They serve, for example, as a 
basis for the quantum mechanical explanation of the existence of atoms and molecules. 


Theorem. A state p of a quantum mechanical system S = {2, p} is stationary if and 
only if wy is an eigenelement of the Hamiltonian operator 3. 


Remark 2 (energy level). Hence y is stationary if and only if there exists a real number £ 
such that y=Fy. E is the corresponding eigenvalue, called energy level. 


Remark 3. The proof of the theorem is easy in one direction. From dy = Fy it follows that 
(Ep_o9—F re) p=y, and hence 
Sr ae 
eye 
according to Theorem 26.4.4/2. Proving the converse is somewhat more difficult. 


. the opposite of a shallow truth is untrue, but 
the opposite of a deep truth is also true... 
(N. Bohr) 

. what greater thing can Man give to Man than 
truth? 
(F. Schiller im his inaugural address as a professor 
of history, Jena, 1789) 


Lo 


ines Interpretations 
28.2.1. Bohr’s Postulate 


The interpretations in the sense of the scheme of Subsee. 28.1.1. for the Hilbert 
space model can in effect be summarized into two rules of interpretation: Bohr’s 
postulate and the statistical interpretation to be described in Subsec, 28.2.2. Let 


‘ 


292 28. Principles of Quantum Mechanics 28.2.8, 


us now consider the following situation: given a quantum mechanical system 
with the Hamiltonian operator 3(, which is in a stationary state p,, that means 
dy, = Eyy,. The system can remain in this state for an arbitrary period of time. 
Later on we shall identify the hitherto considered (mathematical) quantum me- 
chanical systems with actual (physical) ones. Such systems can jump spontane- 
ously from one stationary state y, to another stationary state wo, e.g., by external 
influences such as electromagnetic radiation, where the Hamiltonian operator 
aC remains unchanged. Hence one has 3p: = Eyy9. Physically, this is also inter- 
preted by saying that the system characterized by the Hamiltonian operator 3 
jumps from the stationary energy level E, to the stationary energy level EF. 
When this happens, electromagnetic radiation is absorbed or emitted. Before and 
after this transition the system is in stationary states, i.e., in states that are 
invariable in time. 


Bohr’s postulate (1st rule of interpretation). If is the Hamiltonian operator of a 
quantum mechanical system that jumps spontaneously from the stationary state Yj, 
with ty, =H yy, into the stationary state yo, with Hypo = Expy, then the transition in- 


volves absorption or emission of electromagnetic radiation of the frequency v= 


1 2 
ih \H,—E, 


Remark 1. Here h is again Planck’s quantum of action, as referred to in Subsec. 28.1.2. As 
the Hilbert space H is separable, JC has an at most countably infinite number of eigenvalues. 
Hence there is an at most countably infinite number of sharp frequencies in the sense of the 
postulate. This is the basis for the explanation of the absorption and emission spectra of 
atoms and molecules. 


Ground state, A stationary quantum mechanical system I, y} tends to assume a 
state of minimum energy. 


Remark 2. Hence, the quantities of interest are the lowest eigenvalue # of the Hamiltonian 
operator JC and the corresponding eigenelements y. If the stationary system is undisturbed, 
then it is in such a state y. Electromagnetic radiation of sharp frequencies », in the sense of 
the postulate, excites the system so that it jumps into a state of higher energy, absorbing 
electromagnetic radiation of suitable frequencies. Later on the system spontaneously re- 
turns to lower energy levels, emitting electromagnetic radiation of corresponding frequen- 
cies. 


Remark 3. The above considerations presuppose that 3t has eigenvalues and that there exists 
a lowest eigenvalue. As we shall see later on, many actual quantum mechanical operators 
have this property. 


God does not play at dice 
(Kinstein, who disagreed with the statistical inter- 
pretation of quantum mechanics) 


God casts the die, not the dice 
(cnhaneed literary variant) 
28.2.2. Statistical Interpretation of Quantum Mechanies 


We shall now extend the set of problems to be considered by admitting 
“questions” to be put to a given quantum mechanical system © = {3, y}. Such a 
question shall be connected with a measuring instruction that yields a real num- 


° 
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ber as a measured value. For example, if the (actual physical) quantum mechani- 
cal system of the hydrogen atom is given in a certain state, one may ask for, say, 
the position or momentum of the electron or for the distance between the atomic 
nucleus and the eleetron (naturally such questions already involve some gort of a 
model conception of the hydrogen atom). The problem is to fit such questions 
into the above theory. To this end, every (meaningful) question is assigned a 
self-adjoint operator A, which is also called observable in this connection. The 
method of this assignment will be described later on. Let {Hj}acn, be the spectral 


family of A in the sense of Theorem 26.4.5. By analogy with Subsec. 26.4.2. we 
set 


ye ee 1Or we la, b), —c<u<b=on, 
fp = 2), ie. for B= (a, b), —e=¢a<hbs~, 
i =a — B, 1 Ores = loeeb|h —o<ash<o, 
p= eg — Lo for Bab, —ox~q<p<o. 


where #_24)=0 and E_=E£, 


Statistical interpretation (2rd rule of interpretation). The probability that the 


measurement corresponding to the observable A yields a value that lies tn the interval 
B is || pyll? =e(yp, A, B). 
Remark 1. Tf B=(—-, 7), then E,z=E,. From Subsec. 26.4.1. it follows that h(A) =| yyl|2. 
with ||y||=1, is a distribution function. One has h{co)=1 and 

IB, — Ha) yl? =||Zoyll? — |Zay2=h(b) —A(a) 
for 6 >a. Now it follows from Subsecs. 13.3.4. and 13.3.5. that w(y, A, B) can be uniquely 


continued to become a probability measure on the Borel sets of &,. This justifies the nota- 
tion. 


Remark 2. Thus the question of classical physics “What value will be obtained from the 
measurement?’’ must be replaced in quantum mechanics by the question “What is the 
probability for the value to fall into a certain set?”’. Einstein could never prevail on him- 
self to accept this interpretation, even though he conceded that it is logically consistent and 
very successful. Rather he believed in deeper deterministic relations, the coarsening of 
which leads to statements of the above form. 


Remark 3. If y is an eigenelement of A corresponding to the eigenvalue a, then it follows 
from Theorem 26.4.6. that py =(Ea+. —Eq)y. With B= {a} one then has wy, A,{a}) =|lp|]2=1. 
Thus in the measurement one obtains the value a with probability 1. 


28.2.3. Heisenberg’s Uncertainty Principle 


Definition. Let ©={2, y} be a quantum mechanical system, und let A be an 
observable. 1f pc D(A), then (Ag, p) is called the mean value, and 6(¢, A)= 
=||Ap—(Ag, ¢)@l| the variance, of the measurement corresponding to A. . 

Remark 1. It follows from Subsec. 28.2.2. that A(A) =||£,9|?=w(¢. A, (—~@, 4)) is the distri- 
bution function which generates the probability measure w on R;. Here {Ejhicr, is the spec- 


tral family that belongs to A in the sense of Theorem 26.4.5. On the basis of the considera- 
tions described in Subsec. 26.4.4. it is not very difficult to show that 


(Ap, g)=fAdh(d), 5% p, A)= f\2—-(Ag, —)|2 dA(A) . (1) 
Ry, Ry 


But these are the well-known definitions of probability theory. The magnitude of O(p, A) 
indicates the accuracy with which the mean value (Aq, ~) is assumed. 
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Remark 2. Let A be an operator with a pure point spectrum as defincd in Subsec, 26.4.7. Let 
{Ay }i=1 denote the eigenvalues. Then the total mass of the probability measure w(g, A, B) 
(where B now is a Borel sct in Ry) concentrates at the points A;, Le., w(p, A, py ead 


oo 


Thus one of the points 2; is assumed with probability 1 in a measurement. If p= D) cxpr, 


where {p,};_1 is a complete orthonormal system of eigenelements of the operator A, i.e., 
Ag; =Axrpy, then (1) reduces to 


(Ag, 9) ae 2 AW; Aly {Ar}) = 2, Ar Je,|? ’ 
ca = 


oo 


82g, A) = S Ve— (Aw, 9)? (9, A, (283) = De —(40, #))? le? 


Here > |e,|?=1. 
i 


Remark 3. A measurement is called sharp if 6(g, A) =0. From the definition of the variance 
and from (1) it is easy to observe the following properties: 1. A measurement is sharp if and 
only if m is an eigenelement of A. If « is the corresponding eigenvalue, that means if Ag = 
=p, then (Ag, vy) =". 2. A measurement is sharp if and only if the mean value (Ag, ¢) 
is assumed with probability 1. If A =9{ is the Hamiltonian operator of the system, then it 
follows that the energy of the system can be measured sharply if and only if @ is a stationary 
state (cf. Subsec. 28.2.1.). 


Theorem. Let G={3, p} be a quantum mechanical system. Let A and B be ob- 
servables. If p€D(A)OD(B), Ape D(B), and Bye D(A), then 


d(p, A) 6(y, B)=z |(BAp—ABg, 9)I . (2) 
Remark 4. Heisenberg’s uncertainty principle is deduced from this as a special case: if 


im 8 
(BA —AB) esrrmtee then 
h - 
O(p, A) d(g, B) 25 : (3) 


h hi 
If one substitutes @ for 7 then (3) holds with : instead of me Formula (3) means that a 


relatively sharp measurement with respect to A (i.e., a small value of 6(p, A)) involves a 
relatively unsharp measurement with respect to B (i.e., a high value of d(p, B)). Moreover, 
Remark 3 shows that there are no eigenelements of A or B with the additional properties 
mentioned in the above theorem. 


Conceptions without experience are void; 
experience without conceptions is blind 
(A. Einstein) 


28.3. Quantization 


28.3.1. The Quantization Rule 


To complete the scheme shown in Subsec. 28.1.1., we still have to answer the 
question of how an actual physical problem of quantum mechanies is translated 
into our calculus. For that purpose it is decisive to obtain the corresponding 


ers ; sae 
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Hamiltonian operator 2. Its spectral properties then determine the stationary 
states of associated quantum mechanical systems as well as the absorption and 
emission spectra of electromagnetic radiation. 


Quantization rule. Consider an n-particle system of mechanics and electrodynamics 
with the classical Hamiltonian function HGeie aes ons ty 2) Don), enere yy an 
are Cartesian space co-ordinates and py, .- Pan are momentum co-ordinates. The 
Hamiliontan operator X of the corresponding quantum mechanical system is then 


obtained from the differential expression Ng =f (r +s) P- 
i Ox, 


Remark 1. The procedure goes as follows. At first the classical Hamiltonian function 
Hag, pe) (with k=1, ..., 8n) is formed according to the rules of point mechanics and elec- 
trodynamics. Here (a2, 341,234), K=1, ..., n, is the position of the &-th particle, and 
(P3k—2. P3k—1, p3k) isits Momentum. Thenz;, is replaced by the multiplication operator x,;,and 


ii 
Px by the differential operator ne where fi has the meaning indicated in Subsec. 
. h a 
28.1.2. This gives the differential expression ito= 1 (cr == 5) g. What remains is to 
1 OX;_ 


choose a suitable Hilbert space H, which will in many cases be H =Lo(R3n), and to fix a 
domain of definition D(3t). To get connection with the theory, #f must be self-adjoint. But it 
would even suffice for if to be essentially self-adjoint. In this case we could choose % as the 
Hamiltonian operator. 

Remark 2. The quantization rule is not free of any doubt. On the one hand it necessitates 
several complementary additions, such as the choice of the Hilbert space H and of the do- 
main of definition D(3t). On the other hand, however, even the formation of the differential 


expression { («x . =| is not in all cases unambiguous. If one considers, for example, the 
term 2x,p;, then of course 21p4=p4x;. The quantization of these two expressions yields 
h oa 
xX o ar, cy and 
h oa fi op fh 
7 ol iy) =o Brn a 


which terms are not equal. This is because the translation is done from the commutative 
number domain of real numbers to the non-commutative domain of operators. The physi- 


¢ 


1 
cists behave very wise in such situations, applying the quantization to 5 (xy p4 + p42). 


Remark 3. We have formulated the quantization rule for particles in Ry. For one- or two- 
dimensional problems, of course, a corresponding rule holds. What is decisive is always the 


0 
substitution x, —-z, and p,+>~ —.. 
sub fe k Pk on 


28.3.2. Examples of Quantization 


One-dimensional Motion of a Free Particle: Consider a particle of mass m that 
is free to move in F&, without external forces acting upon it. The classical Hamil- 


u 


tonian function is f(x, p)=3— - Quantization gives 
aim 


hd 1 jh d\2 bh? do 
4 = — = ——_ S SS eee 1 
ns i(s i a eS iF a = 3m de? 0) 


odie 
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Jt suggests itself to choose H = L.(R,) as the Hilbert space and D(d) =CF(R,) as 
the domain of definition. Here, as previously, Cy(,) = D(,) is the class of all 
complex-valued functions with bounded support which are differentiable on RF, 
up to arbitrary order. Thus one observes that in (1) the derivation is handled 
quite formally. 


Harmonie Oscillator: The classical harmonic oscillator has been dealt with in 


m2 ok autor 
Subsec. 12.3.4. The Hamiltonian function is f(x, Oo — x?, Quantization 
: am S 
gives ror 
P h? d2 Poe 
tg= — ae ry. (2) 


2m dx2 ' 2 
Again we choose H = L,(R,) to be the Hilbert space, and D(#)=Cf(R,) to be 
the domain of definition. 


Atoms: Consider an atom whose nucleus is fixed at the origin of the co-ordinate 
system, and let it have x electrons. The position and the momentum co-ordi- 
nates of the 4-th electron are (234-9, 23n—1, 23x) and (934-9, Par —15 Poe); respectively. 
The Hamiltonian function is 


1 3 . n i 1 
fry 5 P)=5- 2 —-Ze > ~+e2 0 —. (3) 
Zui, fo A k=17R ae a 
Here m and « denote, respectively, the mass and the charge of the electron, and 
Z is the atomic charge. Z=n means neutral atoms, whereas Z-+n means charg- 
ed ions. Quantization gives 


> nm 4 il 
Ue cae oe 09 ae |e ere || ; 
atom P 2m + . 4, ae 2 =) : a 
an 92 
Hered = > =. We choose H = In(Ry) as the Hilbert space and D(%atom) = 
rai UA, 


= OF(Rzn) as the domain of definition. We recall that Op (lien) = Dia, is the 
class of all complex-valued functions with bounded support which are differenti- 
able ou Ay, up to arbitrary order. Modification of (4) in an obvious way yields a 
corresponding differential expression for molecules. 


Hydrogen Atom in an Eleetromagnetie Field: Consider the neutral hydrogen 
atom in an external electromagnetic field that is eonstant in time, having the 
electric field intensity G(x) and the magnetic field intensity B(x). Here x= 
= (21, 2, #3) € Ry. According to (24.3.3/3), let B(x) =rot U(x) and E(x) = grad P(x), 
where U(x) =(A y(x), A(x), A3(x)) is the vector potential and (x) is the scalar 
potential. Without loss of generality one can assume that. div (x) =0. Now one 
has to set Z=n=1 in (1) and, moreover, to take into account the elect romag- 
netic field. According to the rules of electrodynamics one obtains 


3 9 


2, (re-£ Aste) —" —e(0 (5) 


2X4, L, X3, Pp, Pr, P3) =—— 
f(y X25 3, Pts Pas Ps) ee) 
as the classical Hamiltonian function. Here it has again been assumed that the 
atomic nucleus is fixed at the origin. (x4, x, 24) is the position, and (D1, Pr» Ps) 
is the momentum of the electron, r = xj + 23 + x3. Further, m is the mass, ¢ is the 
charge of the electron, and c is the velocity of light. By applying the quantiza- 
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Sanam ee ee ees 


tion rule, taking into account that div M(x) =0, one obtains 


h2 fle 3 o é2 2 
aly oz Ag i ca me ‘ 
Om 5 IO 2 Ae) E+ (SES Ieee) @, 


where (x) P= i(%) +A3(x)+A3(x), and the delta operator is defined as 
oy o2 ie 


oe 
Aa ee eG We choose H =L,(R3) as the Hilbert space and Day) = 


=C(R) as the domain of definition. 


Hydrogen Atom in a Weak, Constant Magnetie Field: The following case is of 
special interest : 


se de ily A.(r)=0, 


where A is a constant. One obtains E(x) =0, div M(x) =0, and Biz) (0,0) A: 

Consequently, the field in question is a magnetic field that is constant in space 

and time, its direction being that of the x5-axis. If |A| is small, one has a weak 

magnetic field. In this case it is deemed justified to neglect the annoying term 
2 


Fes |x). This leads to the following set-up for the hydrogen atom in a 


constant, weak magnetic field B(x) =(0, 0, A): 


he Op og\ e&2 
SES 4 
2mei Z (+, oy) oO) 


h2 
zee =—S. Ag 


2m 
where H = L,(R;) is again the associated Hilbert space, and D(3zec) =CF(R,) is 
the domain of definition. 


Remark. For a number of important physical examples we have described differential 
expressions é{y obtained by the quantization rule of Subsec. 28.3.1. In all cases the indicated 
domains of definition D(2) and Hilbert spaces H guarantee that 3 is essentially self-adjoint 
(where in (5) it is necessary to impose physically reasonable additional conditions upon 
A,(z) and P(x)). Hence 3 can be used as the Hamiltonian operator of the respective prob- 
lems. Moreover, the considerations show that, although the quantization rule provides de- 
cisive information as to the structure of the differential operators to be used, suitable modifi- 
cations are applied on a case by case basis. An example of this is zee. Later on we shall 
construct new Hamiltonian operators, where we shall depart even farther from the quanti- 
zation rule. 


28.4.  Single-Particle Problems 


28.4.1. One-Dimensional Motion of a Free Particle 
h2 d2p 

2m dx2’ 
adjoint tn the Hilbert space H = L,( ft). One has Cz=[0, -) and Dz=9. 


Definition. 3 of Theorem 11s the Hamiltonian operator of the quantum mechanical 
system of a free particle of mass m that moves in Jj. 


with D(H) =CF(R,), ts essentially self- 


Theorem 1. The operator Sp = — 


€ 


!) This operator is used to explain the so-called Zeeman effect (cf. Subsec. 28.5.2.). 
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Remark 1. The definition is suggested by Theorem 1 and (28.3.2/1). Cy and De have the 


meaning indicated in Subsees. 26.2.1. and 26.2.2. Theorem 28.1.3 then shows that the free 
particle moving in one dimension has no stationary state, i.e., it is not stable. According to 
Remark 28.2.3/3 this implies that there is no state of this system in which its energy can be 
measured sharply. 


Remark 2. Now let G={%, yw}, be the quantum mechanical system of the free particle 
moving in one dimension, which is in the state y € Lo(.R,). In the sense of Subsec. 28.2.2. we 
ask: “What is the position of the particle?” We have to assign an observable A to this 
question. It suggests itself to resort again to the quantization rule of Subsec. 28.3.1., so that 
the question “Which point x € R, is the position of the particle?” is assigned the observable A, 


Ag=xo(x), D(A) ={@ | p(x) €Lo( Ry), ap(x) € Lo( Ry)} (1) 
Theorem 2. Let ~-=a<b=~, und let y be the state of the free particle moving in 
b : 
one dimension. Then f \p(x)|? dx is the probability that the position of the particle is 
tn the interval (a, b).% 


Remark 3. The theorem is a consequence of the statistical interpretation referred to in Sub- 
sec. 28.2.2. It is relatively easy to show that A of (1) is a self-adjoint operator in Z,(R,). The 
spectral family {#j},¢ rR, corresponding to A in the sense of Subsec. 26.4.5. is ‘ 


(Eap)(x) =4( — =, A)(x) g(a) for pEeL (Ry), 


where ¥(—-, A)(x) is the characteristic function of (— =, A), i.e. y(—, A)(%) =1 fora <A and 
x( — ©, 4)(x) =0 for 24. Hence the proposition of Theorem 2 is easily deduced. 


28.4.2. The Harmonie Oscillator 


h2 dp 
2m da2 
tially self-adjoint in the Hilbert space LR 3 5 v's an operator with a pure point 


i 
Theorem. The operator Xp= +5 29, with D(st)=Ce(R,), is essen- 


: ke 
spectrum. The eigenvalues are hy=h /. : (+5) ,/=0, 1, 2, ... These etgenvalues 
m 2 


are simple, and with a suitable choice of the positive constants c, in Theorem 27.1.3. 
{ 1 


one obtains Hi((km)4 h x) as the corresponding orthonormal eigenfunctions. 


Remark 1. Apart from the physical constants this is a repetition of Theorem 27.1.3. In 
particular, /7;(%) are the Hermite functions of Subsec. 27.1.3. 


Definition. 3¢ of the above theorem is the Ha miltonian operator of the quantum mech- 
anical system of the harmonic oscillator. 


Remark 2. The definition is suggested by (28.3.2/2) and the above theorem. For the classical 

harmonic oscillator, m and & have the meaning indicated in Subsec. 12.3.4. Hence the quan- 

tum mechanical harmonic oscillator has a countably infinite number of stationary states 
i 


-+ — Stem)? Zp 1p? 
x) =Coe . The corre- 


1 il 
HE j((km) * h 2x), The ground state is H)((km) 
sponding energy levels are #1). 


i 
2 ih 


!) A detailed description of mathematical aspects of single-particle problems in quantum 
mechanies is given in [54]. 
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48.4.3. The Relativistic Free Particle in R, 


By analogy with Subsec. 28.4.1., it is possible to deal with a free particle in Rs. 
h2 
Then 3f has the form He = So Ag, where 4 is the three-dimensional Lapla- 


cian operator (delta operator). The dynamics of the corresponding quantum 
mechanical system is then described by the Schrédinger equation of _ ag 
of (28.1.2/1), with p=¢(z, 2), r=(x,, xo, x3). This equation clearly shows that 
our approach chosen so far has been non-relativistic (the Hamiltonian function, 
too, in the quantization rule of Subsce. 28.3.1. is based on non-relativistic mecha- 
nics and electrodynamics). The Lorentz transformations of the special theory of 
relativity described in Chap. 25 mix space and time. Thus it is clear that a 
Schrédinger equation devised to describe a relativistic quantum mechanical 
particle inust contain derivatives of equal order with respect to space and 
time. If one keeps the approved form (28.1.2/1) of the abstract Schrédinger 
equation, then one has to find for 9 a differential expression of first order with 
respect to the space variables. On the other hand, — A (multiplied by suitable 
constants) has proven successful as the Hamiltonian operator for the free non- 
relativistic particle. Combination of both ideas suggests to use operators of the ‘ 


form V — 1. Within the spectral theory of Chap. 26, /— 4 can be given a precise 
meaning. Unfortunately the result is not a differential operator of first order. 
Dirac has shown the way out of this dilemma by the following construction. 
Instead of using L.(R,) as a base space, consider the Hilbert space 


L3(R3) = {f | f= fos fos fi, fe € Lo( Rs)} (1) 
4 
with the scalar product G. 9)12¢R,) = (fis Gk) Lo( Rs g= {91 95 Y3> gz} (The super- 


=i 


script 2 in L3( Rs) is the exponent of 22=4; later on we shall investigate spaces of 
type L3(.R,) with 2” components.) What is desired is a differential expression 3 
of first order in the space variables, the square of which, d?, coincides with (— 4, 
—.4, —A, — A), except for constant factors. Such a result would fully take ac- 
count of the above heuristic remarks. The following considerations show that 
these wishes are realizable. 


Matrices: Tf o =(o%,)f)-, is an N-rowed square matrix with complex elements, 
then o* =(of))p 7-4, With off; =G),, is called the adjoint matrix. o is called self- 
adjoint if o=o*. If one regards matrices as mappings in the complex space Cy, 
then these definitions coincide with the customary concepts of the theory of 
operators. Square N-rowed matrices oj, ..., oy form an anticommutative nor- 
malized system if o,0;= — ojo, for 1+k and if o; is the identity matrix, where 
/=1,..., Mandk=1, ..., M. Here we consider two-rowed and four-rowed ma- 


° . a A 19 5 . 

trices. The four-rowed matrices will also be represented as ey eS) , where the 
: » 29 

a7 are two-rowed matrices. 2p 2 


Lemma. (a) The self-adjoint two-rowed square matrices 


01 i 0 
a=(1 ie i 0 a =1 


_ form an anticommutative normalized system. 
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(b) The self-adjoint four-rowed square matrices 


Otc, : & alee Soi 3 
a=(i,, 0 ), a) OA a) 2) 
form an anticommutative normalized system. Here o4, 62, 63 are the matrices of (2), 


and i=() ‘ and 0=(5 a 


Remark 1. 04, 5, and 3 are the Pauli spin matrices, &4, +, &; are the Dirac spin matrices. 
The Operator 3€P™¢: With a; being the Dirac spin matrices, we set 


te ch 2 o 
Oe ay ae 


+ MC74. . (4) 
1 fen 9 ees 


Here h has the meaning indicated in Subsec. 28.1.2, ¢ is the velocity of light, and 
m is the mass of the particle considered. Further, P= {94(x), Polx), Pa(x), pg(x)}, 


6 AG wee a Cs 
with gx(x)€CF(R3) and x€R;. Finally, a, and oy Bl with =|, ne 
ay ee, : : Ox}: Oxp Ox~, CXR 
a at , have the usual meaning of the multiplication of a matrix by a vector, 

Uh Xe 
where » and Fe, ore regarded as column vectors (column matrices). As desired, 
eh 


d€Pree is a differential expression of first order. Using the above lemma it can 
be shown that 


(a(Pe)2 = {Bp,, Boo, Bos, Boz} 


where By=m2cty—c2h2Ay. This yields us the connection to the above heuristic 
considerations, so that we can attempt to use 3Pit@¢ for the quantum mechanical 
description of the relativistic free particle in Res. 


Theorem. The operator Pi of (4), with the domain of definition 


DEN re {p | a= {P1, Po, P35 Pass pre Cy(Rs)} 
us essentially self-adjoint in the Hilbert space L3(R3). One has 


Ds Dirac=9 and Os Dirac =(—o, — me2|U [mec?, co). (d) 


Definition, 32° is the Hamiltonian operator of the quantum mechanical system of 
a relativistic free particle in Ry. 


Remark 2. The above theorem and the preparatory considerations suggest this definition. 
Jt can be shown that the corresponding abstract Schrédinger equation (28.1.2/1) is in fact 
tailored for the requirements of the special theory of relativity as referred to in Chap. 25. 
Again it follows from Subsec. 28.1.3. that the relativistic free particle, too, has no stationary 
state. 


Remark 3. Before the introduction of 3¢Dita¢ in 1928, physicists were accustomed to the 
fact that all quantum mechanical operators 3f were bounded from below, i.e., Sc[u, ~), 
where y« is a suitable real number. To this there corresponded the semi-boundednesgs of the 
possible energy levels, which was considered reasonable from the physical point of view. 
The unexpected and unintended effect (5) decisively contradicted this conception. It 
induced Dirac to predict the positron, a particle that has a mass equal to that of the elec- 
tron, but the charge + |«|. Later on the positron has been detected by experiment. This leads 
to a conclusion well known among mathematicians and physicists: 


“Formulae are wiser than men’. 
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28.5.1. The Hydrogen Atom without Spin 


Let S9(8,¢) again denote the three-dimensional spherical harmonics of 
Theorem 272 23h and let L..x(r) denote the Laguerre functions of Theorem 27.1.5. 


Theorem. The operator ity, 
h2 


: > : a 
up = = Milo p with D(ky)=CF(R) (1) 


is essentially self-adjoint in the Hilbert space Lo(R3). One has 


me* |= 
—> = [ove] ~~ Byno) ‘ 
On [0, o) and Dd, Dz, =| sah ‘ 2 


me* 


2h2N2 
(multiplicity of the eigenvalue), being spanned by the orthogonal eigenfunctions 


“(fit act) 5 so.v 1 (3) 
pre Sr or 


The ergenraum associated with the eigenvalue — has the dimension N2 


{ ‘ 
— 2me2 


wae [x raw cy=F one = 1 (Gar SYP (9, @). 


Remark 1. @ is the operator Hatom of (28.3.2/4), with Z=n=1, or the operator Illy p of 
(28.3.2/6), in the absence of electromagnetic fields. All the above quantities have the mean- 
ing indicated there. 3{;; corresponds to the neutral hydrogen atom. Thus the theorem sug- 
gests the following definition. 


Definition. 3g is the Hamiltonian operator of the quantum mechanical system of 
the hydrogen atom, with its nucleus being fixed at the origin. 


Remark 2. Later on we shall describe other Hamiltonian operators for the hydrogen atom. 
In all cases the object will be the neutral hydrogen atom with the nucleus fixed at the origin. 
it corresponds to the non-relativistic hydrogen atom without spin. Later on we shall in- 
vestigate the non-relativistic hydrogen atom with spin (Subsec. 28.5.3.) and the relativistic 
hydrogen atom (Subsec. 28.5.4.). The multitude of physically meaningful Hamiltonian 
operators for the hydrogen atom again illustrates that the quantization rule of Subsec. 
28.3.1. is only one tool (though a very useful one) that must be complemented by additional 
considerations which are in part extensive. 


Ground state: From Subsec. 28.1.3. it follows that the hydrogen atom has 

me’ 
 2heW2 
energy levels (Fig. 28.3). According to Subsec. 28.2.1., the ground state is the 
state of minimum energy, L.e., 


infinitely many stationary states, where E.y= are the corresponding 


= 2me? a, _ me 
p(x) =eyr aso ( 3 SH (8, g)=ce ™ where #,=— se (4) 


Here c, and c) must be so chosen that ||yl|z,c2,) = 1. 
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Fig. 28.2 


Speetrum of hydrogen: Now the absorption and the emission spectrum of the 
hydrogen atom can be calculated on the basis of Bohr’s postulate stated in 
Subsec. 28.2.1. The spectrum consists of sharp frequencies. With 

2n-me* 


aioe (Rydberg-constant) 


one obtains the following possible frequencies: 


1 1 re 
Ra =|) No>WN,=1. (5) 
It is customary (and contingent upon historical usage) to combine these possible 
frequencies into series: N,;=1 with N, =2, 3, ... is the Lyman series, N,=2 with 
N,=3, 4,... is the Balmer series etc. Fig. 28.2 is a qualitative representation 
of these series. The frequencies of the Balmer series fall into the range of visible 
ight and can be readily observed by optical means. The experimental values 
are in excellent agreement with these theoretical values. Only high-precision 
measurements reveal deviations, which shall be dealt with later on in the rela- 
tivistic theory. 
5 ee eee ies 


5 ay 


Bohr radius: Bohr stated the radius of the hydrogen atom to be 
h2 
a=—-~0,53-107-8 em. 
me 


This raises the question whether this number can be found in the above calculus. 
To this end we put the “question”, in the sense of Subsec. 28.2.2.: what is 
the distance between the electron and the atomic nucleus? As the nucleus is 
fixed at the origin, this is the question for the distance r of the electron from the 
origin. According to the quantization rule of Subsec. 28.3.1. (and by analogy 
with (28.4.1/1)) we assign to this question the self-adjoint operator A, 


(Ag)(x) =Ial f(z), D(A)={y | 9(x) €Lo( Rs), lal p(x) € Lo(Rs)} , 


in the space L(R;). If the hydrogen atom is in the ground state (zx) of (4), one 
obtains the following result: the probability that the distance of the electron 
from the origin lies between x and 4, where 0 =x SAS, is 
| 22, 
(Ba — Bx) vlligay =e f re © dr. 
“ 
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Here {Eitier, is the spectral family of A. Hence the “most probable” value is 


the maximum of the probability density o(r) =orte *'. But this is r=a (see 
Fig. 28.4). In this way we also find the Bohr radius in the theory described here. 


alr) 


Fig. 28.4 
@ Va 


Quantum numbers: V=1, 2,... in (3) is called principal quantum number, 
L=0,....N—1 is called secondary quantum number, and M=-—J,..., L is 
the magnetic quantum number. Later on we shall return to the meaning of 
these numbers. 


28.5.2. The Zeeman Effect 


The symbols have the same meaning as in the preceding subsection, especially 
the functions fy,z,3,(7) and the Rydberg constant R. Further let 


lel hi 


2me 


UB 


be the Bohr magneton. 
As usual, m and « are the mass and the charge, respectively, of the electron, 
' ‘ : h 
h is Planck’s quantum of action, h=5, and cis the velocity of light. 
Theorem. For every real number A, the operator Ugee of (28.3.2/7), with D(Izec) = 
=C F(R), ws essentially self-adjoint in the Hilbert space Lo( Rs). One has 


= { h Am} 1 
UZee Zee N2 ee oe . ©) 
M=—-N+1,...,.N—-1 


The eigenraum associated with the eigenvalue = uBAM has the dimension 
N—|M| (multiplicity of the eigenvalue), and is spanned by the orthogonal ergen- 
functions {fy,b,a1(")}n=|m1,..,.N—1° 

Remark 1. The theorem and the considerations deseribed in Subsee. 28.3.2. suggest the 
following definition. 


Definition. Hzee is the Hamiltonian operator of the quantum mechanical system 
of a hydrogen atom, with its nucleus being fixed at the origin, in a constant, weak 
magnetic field B(x) = (0, 0, A). 

Rh . 
Remark 2. It follows from (1) that the eigenvalues — +r of the hydrogen atom without a 
magnetie field split up into 2N —1 equidistant eigenvalues when a weak, constant magnetie 
field is switched on. Here the sum of the multiplicities of the splitted eigenvalues of Itz. for 
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TR) 

a fixed N is equal to N2, i.e., equal to the multiplicity of the eigenvalue — We of Itz (the 
LI es : ; , 

eigenrawm of — for tq is decomposed; Fig. 28.5). With a suitable choice of A it may 


well happen that 


Rl 


Rh a 
~ Hy? 7H Bday = ye eA My 
Ny 9 


for V,+N5. These eigenvalues then coincide “accidentally”, and the theorem has to be 
modified in an obvious way. 


ae 
> 
Sl 
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Spectrum and seleetion rules. By analogy with (28.5.1/5), now the values 


1 1 UB 9 
R 52 xi)+ h A(M ,— M,)>0 (2) 
are possible frequencies of absorbed or emitted electromagnetic waves. Here 
Ni=1,2, ... and N,=1,2, .. -, while M,=—N,+1, ..., Ny-1 and i= 
= —N,+1,..., Ny—1. However, it turns out that only a few of these possible 
frequencies are in fact realized. This is due to the selection rules. One may put 
the question of what is the probability for the system to jump from the (nor- 
malized) state fy,1,,u,(%) into the (normalized) state [x,,In,ar,(%). We shall 
not discuss this theory here. It yields the selection rules (for dipole radiation) : 
the transition from fy,,t,,1,(x) to fy,,1,,11,(7) has a positive probability only if 
[Z;—Lo|=1 and |M,—M,|=1. With these qualifications, the frequencies (2) 
are in good agreement with experimental findings. 


Quantum nunibers. It is now clear why M was called “magnetic quantum num- 
ber”: it is responsible for the splitting of the spectrum upon switching on a 
constant magnetic field. 


28.5.3. The Hydrogen Atom with Spin 


Anomalous Zeeman effeet. The frequencies (28.5.2/2) (normal Zeeman effect) 
have been verified by experiment (taking into account the selection rules). 
There are, however, additional splittings which resemble those of (28.5.2/2) but 
are not covered by this formula: this is the anomalous Zeeman effect. Again 
one may attempt to explain these frequencies in the sense of Bohr’s postulate. 
What is desired is a Hamiltonian operator fitting the problem, which has suitable 
eigenvalues and deviates as little as possible (and in a physically plausible 


manner) from the proven operators 3 and Stee. The experimental data of the 
anomalous Zeeman effect suggest doubling the spectrum of 3€y (more precisely: 


QSSa 
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the dinension of the eigenraum associated with the eigenvalue ohh should 
N2 


be 2N2 instead of N2), which then splits up when a weak, constant magnetic 
field B(x) =(0, 0, A) is applied. Quantitatively, the eigenvalue —Rh (which is 
now twofold, having been simple before) should split up into —Rh+y,A 
and Rb — «pA. The other eigenvalues should behave correspondingly. Mathe- 
matically, such a repair is easy to accomplish. By analogy with (28.4.3/1), we 
consider the Hilbert space , 


Ci vi, fr€Lo( R3)} (1) 
with the scalar product | 
Vis Dison =Vie Gilg) + Us Glin, 9=( 92}. 


The spectrum of the operator JCy is doubled by considering the self-adjoint 


operator (lag,, Xaqy} in Lz (Ry). An analogous approach is chosen for Wzec- 
The desired (and necessary) correction corresponding to the anomalous Zeeman 
effect can now be attained as follows. All the symbols used have the meaning 


Boe : aa 7: é 
indicated previously. Derivatives 4g and uae with p={y,, go}, are defined as 
in Subsec. 28.4.3. Oxy 
Theorem. The operator 132i, 
5C5Rg = ees { = aA ls eo ce ey pAt 2 
Zee tg es ahh ar mee? as) ee —%F} (2) 


with DARE) ={p | ~={o,, Go}, eC CT(Rs)} ts essentially self-adjoint in the Hil- 
bert space L4(R3). One has 


Dae = = me 


apm Page| — ya aA band o 


IS SW Proc 
Ai=—N-+1,....N—1 


Remark 1. The modification made to the doubled operator ze. of Subsec. 28.5.2. is the 
“spin term” Sp=p eA {y;, —Qo}. This operator is very simple. If shifts the speetruin of Iz65 
by «2A to the right in one component and to the left in the other. This is just the desired 
effeet. Moreover it is observed that the eigenraums associated with the respective eigenvalues 
are spanned by the orthogonal elements 


fw.t,u(®) t={fw,n,a(%), 0}. fv z,ar(z) ) ={0, fiv,z,a1(2)} (4) 
N, £, M have the previous meaning. (We dispense with the elementary but somewhat 
lengthy procedure of clarifying the correspondences between the respective eigenvalues and 
eigenfunctions.) The preceding considerations as well as the requirements described above 


now suggest the following definition. 
Definition. 332" 7s the Hamiltonian operator of the quantum mechanical system 
of a hydrogen atom, with tts nucleus fixed at the origin, in a constant, weak magnetic 


field B(x) =(0, 0, A), where the effect of spin has been taken into account. 


Remark 2. In the physical interpretation, an additional degrec of freedom of the electron, 
called spin, is introduced to explain the anomalous Zeeman effect. The spin has only two 
possible states (orientations), usually denoted by t and , . They correspond to the elements 
(4) in our caleulus. The spin is interpreted as an intrinsie magnetic inoment of the electron 
that can assume the valucs seg and —s:p, responding accordingly to an external magnetic 


field. 
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Spectrum. By analogy with (28.5.1/5) and (28.5.2/2), now 


(2 = a +e A(M,—My,)>0 and 
a 


Sik 1 LR 5 
a -5z)+ h A(M ,— My+2)>0 
are the possible frequencies of absorbed or emitted electromagnetic waves. 
Again V,=1,2,... and N,—1, 2, ..., while M,=—N,+1,...,N,-1 and M,= 
= —N»o+1,..., N)—1. These possibilities are again restricted by the selection 
rules, but give a good description of the normal and the anomalous Zeeman 
effect otherwise. 


Perturbations. Generally the eigenvalues of 3, ores and aeepin are degenerate 


(i.e., not simple). According to our calculus described so far, every normalized 
eigenelement of an eigenvalue is a stationary state. But physical reality can only 
be attributed to such stationary states that undergo only slight variations as a 
result of small perturbations of the corresponding Hamiltonian operator. If under 
such a (physically reasonable) perturbation a degenerate eigenvalue is split up 
into a multitude of simple eigenvalues, then physical relevance can be ascribed 
only to those eigenelements of the unperturbed Hamiltonian operator which 
result from the eigenelements of the perturbed operator as the perturbation 
approaches zero. What is then left of the originally A-dimensional eigenraum 
are only A orthonormal eigenelements of physical relevance. If Gee is modified 
by 
ey +h(r)p+6A{,— Go}, 
9 

then h(r) can be regarded as a (physically reasonable) perturbation of =, and 6: 


as a perturbation of wz. With a suitable choice of h(r) and 4, the &-degenerate 
eigenvalues of (3) then split up into K simple eigenvalues. Further it turns out 
that the orthonormal eigenfunctions of physical relevance agree with the sys- 
tem (4). 


Quantum numbers. The result is as follows: the system (4) describes the 
physically meaningful stationary states of the hydrogen atom (with the nucleus 
fixed at the origin) in the constant, weak magnetic field B(x) =(0, 0, A), where 
the spin has been taken into account. Here the following notation has been used 
for the quantum numbers: 


principal quantum number N=1, 2, ..., 
secondary quantum number L=0, 1, ..., N— iL. 
magnetic quantum number M= —L, ..., 0, ..., lb, 
spin quantum number S=t, |. 
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28.5.4. The Relativistie Hydrogen Atom 


The operator 3(”'"*° has the same meaning as in Subsec. 28.4.3. Further let 
ee 


ac wee be the Sommerfeld fine structure constant. It is physically dimen- 


sionless. (Originally it had been assumed that « is peecily equal to = This 
fl : 
is the reason why it is written « ~ Tay even today. 


Theorem. The operator ICP, 


Mieratleg Se == P, DO) pe. IDG : (1) 
is essentially self-adjoint in the Hilbert space L3(R3). One has 
CzDirac =(—, —me2]J [rmec?, «) (2) 
and 
Dvirac = aa dD. ;Dirac ~ = {Exp} PS = I 2c 
XH ieee 
where 
{ 
oe 2) 
Ee ge Sc (1 -—_.| . (3) 
(n+ Vke— 22)? 


1 
Remark 1. The proof exhibits a peculiarity: it uses the fact that C5: That is, the know- 


ledge of the numerical values of physical constants is required. The point spectrum D_ 


yDirac 
accumulates from the left at the point mc? (Fig. 28.6). “7 


Remark 2. The operator 3}"*° is derived from the operator 3fPi"@¢ of the relativistic free 
particle in #; in the same way as the operator 3(y of Subsec. 28.5.1. is derived from the 


Re 
operator ao A of the non-relativistic free particle in R3. The above theorem then sug- 
n 
gests the following definition. 
E ‘ 
a 2 a es 
ine* me? 


Definition. 32° as the Hamiltonian operator of the quantum mechanical system 
of the relativistic hydrogen atom with its nucleus fied at the origin. 


Remark 3. If one considers the relativistic hydrogen atom in a constant, weak magnetic 
field, it turns out that the above formulation automatically covers the spin in the sense of 
Subsec. 28.5.3. 


Spectrum. Now, again following Bohr’s postulate of Subsec. 28.2.1., the fre- 
quencies of absorbed and emitted electromagnetic waves can be stated. It is 
of interest to compare them with the frequencies (28.5. 1/5) of the non-relativis- 
tie hydrogen atom. Expansion of (3) with respect to « gives 


me‘ a 3. oe 
B = ee | oa, Sle ooo || 3) (4) 
Be 2h2(n+ by? | r@+kh) 4 atk | 
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where the remainder contains terms with «‘, «6, ... Since nothing but the differ- 
ences of Hy; are of interest, the term mc? has no effect. Setting N=n+k and 
comparing this with (28.5.1/2), one observes that the relativistic correction is 
less than «2102 "/,, i.e., less than 10-2 9. A minutedifference, which can, however, 
be detected by high-precision measurements. It is also seen that the non-relati- 
me’ 

2h2(n +h)? 
that are very close to one another. A corresponding splitting then occurs with 
the frequencies in the sense of Bohr’s postulate: this is the fine structure of the 
hydrogen atom. These minute splittings are in good agreement with experimen- 
tal findings. 


vistic eigenvalue splits up into different relativistic eigenvalues 


28.6. Atoms and the Periodie System of the Elements 


28.6.1. Atoms without Spin 


Theorem. The operator Xatom of (28.3.2/4), with DOG ton) == Ce eas essentially 
self-adjoint in the Hilbert space Ly(R3n). 


Remark 1. From the considerations in Subsec. 28.3.2. it follows that datom Might be suitable 
to describe the ionized atom whose nucleus is fixed at the origin. The theorem suggests the 
following definition. 


Definition. atom ts the Hamiltonian operator of the quantum mechanical system 
of an atom (without spin and without Pauli’s principle being considered) with the 
atomic charge Z and n electrons, the atomic nucleus being fixed at the origin. 


Speetrum: In contrast to the hydrogen atom, the spectrum of dipom CammOt an 
general bé determined explicitly. There is, however, a lot of qualitative infor- 
mation. For neutral atoms, i.e., Z=n, the following is known. There is a number 


# such that Cz , =[#, ce); on the left of « there exist infinitely many eigen- 
“atom 


values of finite multiplicities, which accumulate at ju (Fig. 28.7). Further, in 
[u, oc) there exist infinitely many eigenvalues, too, with an infinite number of 
accumulation points. The physically interesting part of the spectrum, which 
hes in (—e, uJ, is hydrogen-like. 


Hh tH ttt Pig. 28.7 
ya 


Remark 2. The investigations on the hydrogen atom show that a cannot yet be the last 
word. The spin, which we had found an cssential new aspect in the case of the hydrogen 
atom, has not been taken into account. Moreover we must try to introduce the so-called 
Pauli principle into our axiomatics, which will lead to a deeper understanding of the periodic 
system of the elements. 


28.6.2. The Space Lz4(Rsn) 


We shall now develop the necessary mathematical tools for our further 
considerations. 


The space L3( Ren). Let z= (x9n—2, X9n—1, COW c fia) Wit he ee By analogy 
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with (28.4.3/1), let L3(R3,) be the Hilbert space 
{f | f={folz, OnE; Sah Aan DU} Xn), ones ee Ce ery Xn)}, fr€ Ly(Ryn)} 
with the scalar product 


Qt] 


(f, Di Ran) a (fes Gk )Ly(Boyn) 


we rerg — 409.225 Yyn_, 3 € L3( Kon). 


The group S,. As usual, G,, is the group of the x! different permutations of the 
natural numbers from 1 to 2, ice., 


QA, 05 R)=(G1, 5 Qn) and (py, «5 Da) = (Qp4> ++) Uy) « 


The group multiplication is defined as successive application; q-1, with 
Q (G4) +++) In) = (1, ..., 2), is the transformation inverse to qg, the identical trans- 
formation is the identity element. Finally, let the sign (— 1) be determined by 


eres 2?) 
IT (gaat) I Ee (Yay. — Yay) « 
If (—1)"=1, then q is called an even permutation, and an odd one if (—1)¢=—1. 


The operation qf. Every integer 7, 0=j=2"—1, has a unique representation as 
an n-digit binary number, j= 2)... a, Where a, is either 0 or 1. If qe Gn, 
then we set 

Aj\=2q, 009 Ciih 
if j=) ... a in the binary representation. If f¢ L3(R3,), then 

GI = {fol Bay» ++» Xan)» => fil Zaps +> Fay)s oor fon_ 4 (Bays os %q,)} (1) 
generates a unitary operator in L3(Rs,) where f; stands on the place gfj]. In 


(1) it is not only the co-ordinates Z,,...,%, that are permuted, but also the 
places of the elements f;. One has g[0]=0 and g[2"—1] =2" —1. 


Definition. 
L3. a(n) = {fF | fELS( Ron) where af=(—1)%f for all qeGn}. 

Remark 1. Thus Ey A( Bn) is defined to contain all those elements of L5( #»,) which are anti- 
symmetric under the transformation / -q/. Correspondingly, it is possible to define a space 
L5\s( Rp) that consists of all elements of L5(R3,) with gf =f for all ¢éS,. This space is of 

8 5 n 
physical interest, too (like the space Lz 4(fz,)). 
Theorem. (a) L3 4(M3n) ts an infinite-dimensional closed subspace of L3(Rs,) 
(and hence a Hilbert space). The corresponding projection operator P is 


ut a4 
1 fea > (— 1)‘qf for ye L3( Ray) : 
Ms ge En 
(b) Lf hy(y), --s hel(Y), with Y=(Yps Yo, Ys), WS an orthonormal system in L( Rs), 
and if f= {0, ...,0, hy, (%4) ++ hi, (Xn), O, «+, 0}, then Pf=0 of and only rf there exist 
j-th place a a 
at least two indices rand s, with |1Sr<s=n, such that hy,(7/) =hy(Y) and a, =a. 


Here j=%, ... a 18 the binary representation of j. 
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Remark 2, Part (b) looks relatively harmless. The proof, however, is not so simple. Hidden 
behind this formulation is the Panli principle. Elements / of the above form with P/+0 will 
be of interest later on. For a given & it will then be a combinatorial problem to insert suitable 
products of the functions /,(¥) at suitable places, 


28.6.3. Atoms with Spin 


Theorem. The operator Ham", 
ae! li h2 Z n 4 ; 1 ; 
0 i _ 7.9 i 5 ot 
dlatom P = + oe pune Ped Pp (1) 
with | 
Dein) = {9 | P= (PQ. +5 Pyn_ FELT a( Ban), GE CH(Rin)} (2) 


us essentially self-adjoint in the Hilbert space L3, 4( Ryn). 


Spin and Pauli principle (first version). The transition from Slatom of (28.3.2/4) 
and Theorem 28.6.1 to 3¢7@3h can be described as consisting of two steps. First 
we substitute 13(Ry,) for L5(R3,). Physically this corresponds to taking into 
account the spin. According to Subsec. 28.5.3. the spin is interpreted as a 
property of the electron, where the spin quantum number S has the possibilities 
t and }. In this interpretation, » electrons permit 2” spin configurations. 
Setting t=0 and |=1, one may interpret a spin configuration +4 | | ... T= 
as a binary representation of j and assign the place 7 in {q, ..., Pon_,}to this 
configuration. Thus each of the 2” components L(g) of L3(R3n) represents 
a certain spin configuration of the » electrons. To the transition from L»(R4p) 
to L3(A3,) thus there corresponds the transition from atoms without spin to 
atoms with spin. However, it turns out that L3(R5,) is too large as a base space: 
many of its eigenfunctions do not allow a physical interpretation and contradict 
the Pauli principle. Roughly speaking, this principle says that, if L3(Ro,) in- 
stead of Lo(R»,) is taken as the base space, only those eigenelements of Jlxtom 
which belong to L% 4( Ry») are of physical relevance. The theorem is adapted to 
this situation. The fact that (1) applies to L5 4( Bsn) follows from the commuta- 
tivity of Iii and P of Theorem 28.6.2, which is formally written as ee en 
= Pitiau If one uses the above interpretation of the place 7=| | +...) asa 
characterization of a certain spin configuration, then it is now possible to give 
a physical interpretation of the operation qf of (28.6.2/1). qé¢G, permutes the 
places w,, ..., %, of the n electrons, &q,—> ©), -., Xq, > Xn. Lf the x electrons have a 
spin configuration +4... t=j, then under this permutation the places of the 
spin configuration are carried over without being changed; the function on the 
place j=«,... %, is then shifted (under the permutation Xq.> tp) to the place 
NI =%q, +++ %q,+ Now the above, rough first version of the Pauli principle can 
be reformulated as follows: I(iijnm admits only those stationary states g which 
are antisymmetric under a permutation of the x electrons with preservation of 
the spin configuration, that is, q@=(—1)%p. In the next section we shall make 
this more precise. The theorem as well as the statements made above now 
suggest the following definition. 


Oo OO opPauli - a 6 6 
Definition. 3720 is the Hamiltonian operator of the quantum mechanical system 
of an atom (where the spin as well as the Pauli principle have been taken into 


29 Fi 5. iG 

28.6.4. 28.6. Atoms and the Periodic System coll il 
a ee Oysiem _odd 
account) with atomic charge Z and n electrons, the atomic nucleus of which is 
fixed at the origin. 


Spectrum. For neutral atoms, Le., Z=n, the interesting part of the spectrum 


FpPauli ; . ae . : 
of Hafom iS again hy drogen-like: there exists a real number y such that Czpauli = 
ae ft . ies: , atom 
=[¥, --), and on the left of » there exist infinitely many eigenvalues of finite 


multiplicities, which accumulate at » (Fig. 28.8). However, other than in the 
case of the hydrogen atom there are also eigenvalues in [», <0) (for »=2). In 
particular there exist infinitely many stationary states (what a luck, for other- 
wise the atoms would not know why they are immortal). There is a lowest 
eigenvalue, and hence there are also ground states. 


; Fig. 28.8 
v 


28.6.4. The Pauli Principle 
The entities we are interested to know are the lowest eigenvalue of J(F¥" as 
well as the associated eigenraum. Unfortunately there is no chance of an explicit 


analytical determination. Following the best physical custom, it is attempted to 


replace J¢Pauli by a simpler operator, hoping that the simpler operator will 


approximately show the same (qualitative and quantitative) behaviour as 


7022 What is annoying in (28.6.3/1) are the electron-electron interaction 


1 , : 2 ; : 

terms e2 >’ -— ¢. Onetriesto replace this term by >) V(rz) g. A suitable choice of 
l>k TLE 1 

V will also be effective in the sense set out above. The operator modified in this 

way is much simpler, then x electrons are decoupled, and the operator is in effect 

separated into n operators of the hydrogen type. If the approximate quanti- 

tative correctness of the description is dispensed with, one may set V(r) =0. 


This then gives the operator 32224, 
° : h2 n 
(Pauli es Wg Ae. oo = : il 
toms a OP in? (1) 
DGG ier) = DOs) , 2 


which is again considered in the Hilbert space Li 4(/t3n). It is hoped (not in 
o j . . . . 
vain, as the success shows) that 3(P2"" gives a qualitatively correct description 


of the spectrum of 3228", but at least of the lowest eigenvalue and of the 


atom? 
e jo¢ . 
structure of the associated cigenraum. Apart from Z, 3724 is separated into x 


Hamiltonian operators of the hydrogen type according to (28.5.1/1) in each of 

the 2” components of p={¢y, +) Yon_,}. If in (28.5.1/1) e? is replaced by Ze?, 
; : RhZ2 |, ; 

then the corresponding operator has the eigenvalues — We? N= lp 2ee 


where R is again the Rydberg constant. Correspondingly, one has to replace 
e? by Ze2 in (28.5.1/3). We shall, however, keep the notation fy,z,a/(x) for the 
function modified in this manner. Now the eigenvalues and eigenfunctions of 


RhZ? 
° . = * my 
a¢F2u" can be constructed of a oye and fy,1,(7), where it must, however, be 


taken into consideration that the base space is L3 4(2sp) (and not L3( Ryn). 


20% 
ee 
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Theorem. The operator HEA) stated in (1), with the domain of definition D(3CE28) = 


=D(Hiem) according to} (28.6.3/2), as essentially self-adjoint in the Hilbert space 


atom 
L353. 4( Ryn). If N(n) ts @ non-negative integer such that 
N@) Nin) +1 — XQ) 
De, A NE ea (2) 
N=0 X=0 N=0 
r(n) 


then E= —2RhZ2N(n) —RhZ? = 78 the lowest eigenvalue of the closure of 


; (W(x) + 1)? 
shaun The associated eigenraum is spanned by 


n 
Pir, 0; TT fy,,t,00, (21), OOte (3) 
f=1 j-th place 
Here the quadruples (Ni, ly, Miu), t=1,...,n, are patrwise different, where 
JH%1 0 Om ws the binary representation of 7=0, ...,2”—1. Further, N;= N(n) +1, 
where exactly r(n) of the numbers N; are equal to N(n)+1, and L,=0, ..., Ny—1as 
QO? Wi = a rl wire 


Remark 1. If (1) would be considered in PECANS Y then —RhZ2n would be the lowest eigen- 

value, and the associated eigenraum would then be spanned by the elements 40, ..., 0, 
n 

dl ioe), Oe. of. Due to the restriction to LY 4( Ran) the lowest eigenvalue H increases 

t=1 


substantially. The decisive effect is due to (3) with the indicated qualifications. This is a 
relatively simple reformulation of Theorem 28.6.2(b). In particular it is seen that the ele- 
ments of (3) are not zero. 


Definition. The ground state of an atom with the atomic charge Z and the nucleus 
fixed at the origin is qualitatively described by the normalized element (3), where 
the quadruples (Ni, Li, Mi, x1) satisfy the above conditions and [es Ln =7 Charac- 
terizes the spin configuration of the n electrons. 


Remark 2. In Subsec. 28.5.3. we had found by perturbation considerations that for a fixed 
N physical relevance can be attributed to the functions fw,c,m(x) alone (and not to their 
linear combinations). Now we also stick to this statement, as is shown by (3) and the above 
definition. The magnetic quantum number would, however, become important not until 
magnetic fields are applied. 


Pauli principle: According to Theorem 28.6.2(b), (3) is different from zero (and 
hence, after being normalized, describes a stationary state) if and only if the 
n quadruples (N;, L;, Mi, «:) are pairwise different. This is the usual wording 
of the Pauli principle. If the additional conditions stated at the end of the 
above theorem are satisfied, then a state of minimum energy 1s realized, i.e. the 
system is in the ground state. Higher energy levels represent excited states. 
According to Bohr’s postulate, the jump transitions between these states yield 
possible frequencies of absorbed or emitted electromagnetic waves. For n=1 
(hydrogen atom) the Pauli principle is not effective; in this case one has the 
system (28.5.3/4). 


Aufbauprineiple: The following interpretation of (3) is customary in physies. 
Each of the n electrons is characterized by a quadruple of 4 quantum numbers 
(Ni, Li, Mi, u), where N, L, Mand «=S have the same meaning as at the end 
of Subsec. 28.5.3. The state of the atom is then built up from the “states” of 
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the electrons in the sense of formula (3). To what extent such an aufbauprinciple, 
which neglects electron-clectron interaction, yields a satisfactory description, 
must be shown by comparison with the experiment. 


28.6.5. Periodic System of the Elements 


Definition. Two (neutral) atoms with the atomic charges n,; and ny (number of 
electrons) are chemically related if r(n 1) = 72) tn (28.6.4/2) 


Periodic system. According to this definition, the system of chemical elements 
exhibits periods of length 2N2, where N =1, 2,3,.... The first three periods 
then include 2, 8, and 18 elements. The first period includes the elements H 
(hydrogen) and He (helium). According to Def. 28.6.4, functions fy,z,1(2) 
with V =1 will suffice for the construction of the ground state. If n=3 (lithium, 
Li), then a function fy,z,1¢(@) with N =2 is required in addition. This period 
covers the atomic charges up to n=10 (neon, Ne). The next period starts from 
n=11, that is sodium, Na. Chemically related elements are H, Li, Na with r(n) = 1, 
and He, Ne with r(n)=0. 


Shell model. The aufbauprinciple of Subsec. 28.6.4 now leads to the shell model. 
The individual shells are determined by the principal quantum number JN, 
and the shell V accommodates a maximum of 2? electrons. The atom provides 
for as favourable an energetic arrangement of its n electrons as possible. To this 
tendency there corresponds a progressive filling-up of the shells from the in- 
terior to the exterior ones. Then the outermost, that is, the (N(n)+1)-st shell 
is occupied by r(7) electrons that are responsible for the chemical behaviour of 
the atom. 


Perturbations. The qualitative approximation according to Def. 28.6.4 becomes 
increasingly questionable as n increases. Moreover, following the lines of the 
perturbation considerations of Subsec. 28.5.3., one would expect that, in addition 
to N, the secondary quantum number L will also influence the magnitude of the 
eigenvalue. On the other hand, the magnetic quantum number M has no effect. 
In a comparison of the above considerations with experimental data, the first 
deviation is found to occur for n=19. For larger values of x these deviations 
increase more and more. What is more important than these deviations, however, 
is the remarkable fact that it is possible to give an explanation of the periodic 
system of elements on the basis of quantum mechanical reasoning. 
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29. Geometry on Manifolds I (Tensors) 


ele Manifolds 


29.1.1. The Paracompact Hausdorff Space 


Topology. A system % of subsets of a given set M is called a topology in M 
if 8 has the following three properties. 

1.0¢% and M€ (as usual, the void set is considered to be a subset of every set). 
2. Finite intersections of sets in 8 belong to B. 

3. Finite or infinite unions of sets in B belong to B. 

The system $ with the above properties is also called a system of open sets. 
Be is called a neighbourhood of a point x of M if xé B. 


Hausdorff space. A set M with a topology % is called a Hausdorff space if for 
every pair of different points #¢ 4 and y¢ M there exist sets B,€% and 16 
such that B,1 B,=0 (Fig. 29.1). That is, 8 contains enough elements to be 
used for a separation of the points of M. 


Fig. 29.1 


Compact sets. A set CO in a Hausdorff space M with the topology % is called 
compact if in every covering of C by a (not necessarily countable) number of 
open sets it is possible to find a finite number of sets that suffice to cover OC. 


In other words, U B;>C with B,¢% implies that there exist sets B,, such that 
WV iel . 


Closure. A is called a closed set in the Hausdorff space M with the topology 8 
if there exists a set B¢Y such that A= M\B. Let 9% be the system of closed sets 
in M. If C is an arbitrary set in M, then C=(D, where the intersection is 
taken over all DEQ with DDC. C is a closed set, called the closure of C. 
Definition. Let M be a Hausdorff space with the topology %. 

(a) A covering {Uj}icr CB of M, that is U U;=M, is called a refinement of a 

; iel 

covering {Vileex B® of M if for every 1€L there exists a k=kh()CK such that 
U; Ss Woes 

(b) A covering {Up ier cB of M ts called locally finite if for every x€ M there 
exists a neighbourhood V, such that U;VV, is nonvoid for only finitely many 
parameter values «El. 

(¢) M is called paracompact if for every covering {VikeexcB of M there exists 
a locally finite covering {U pier CB that isa refinement of {Vive x: 

(d) M ws called locally compact if for every x«€M there exists a BE such that 
cB and Bis compact. 


99.7.9 g . ‘ 
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Remark 1. The 2,, is a paraeompaet. Hausdorff space, provided that the usual open sects are 
chosen as a topology. This is also true for arbitvary sets in #,, that are open in the usual 
sense. 

Remark 2. From now on. the term “eovering” shall always be understood to mean a ‘cov- 
ering by open sets”. Further, from now on we shall exclusively use finite or countably in- 
finite coverings. It ean be shown that this does not involve a loss of generality, provided that 
the Hausdorff space is connected. Here, the set BEY (and espeeially also J/) is said to be 


connected if B cannot be represented as a union B= B,U B, of two nonvoid open disjoint 
sets. 


29.1.2. C--Manifolds 


From now on we shall always assume that (M, $8) isa paracompact connected 
Hausdorff space with the topology %. 


Induced topology. If BEB is connected, then (B, Bz), with B,={C | Ce, 
Cc B} is again a paracompact connected Hausdorff space. %» is called the 
topology induced by %. 


Topological homeomorphism. Let (4f,, 8,) and (Ms, %3) be two Hausdorff 
spaces. A topological homeomorphism is a one-to-one mapping F of M, onto My, 
such that F(%\)=Wy. (That is, the system of open sets 8, is mapped one-to-one 
onto the system of open sets $5.) 


Dimension. A Hausdorff space (M@, 8) has the dimension x if for every x¢M 
there exists a (connected) neighbourhood V’, that can be mapped homeomorphi- 
cally (with the use of the induced topology) onto a (connected) open set in 
R,, (Fig. 29.2). (Open sets in #, shall always have the usual meaning.) 


Atlas. Let M= U !, bea locally finite covering of the n-dimensional Hausdorff 
k=! 

space (M, $). If , is a homeomorphic mapping of V, onto an open set in FR, 

then {V3 Fy}g_, 18 called atlas. MW is then described by local co-ordinates (or 


local charts). 

Eyuivalence elasses of atlases (Fig. 29.3). If {J’,; F,} is an atlas of (M, ¥), then 
let F.(V,)=U0,.c0 R,. If VN V,+0, then F,F;1 is a one-to-one mapping of 
F.AV,0%,) onto FAV,NV,. From now on we shall always require that all 
of these mappings be differentiable up to arbitrary order, the term “atlas” shall 
be understood to imply this property. Two atlases {V,: F,} and {W,; G,} are 


Tote, HS) 
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called compatible if {V,, W,; /;, Gjgi-, is an atlas (this is a demand made 
upon the above properties of differentiability), The term ‘“‘compatible“ is an 
equivalence relation, and hence it is possible to form equivalence classes of 
compatible atlases in the class of all atlases of a given n-dimensional Hausdorff 
space. 

Orientation. An atlas {V,; F,} is called orientable if the Jacobian determinants 
of all the above mappings F/F;' of F,(V,0V;) onto F,(V;,NV)) are positive. 


Definition. An n-dimensional orientable C»-manifold consists of an n-dimensional 
paracompact connected Hausdorff space and an equivalence class of atlases that 
contains at least one orientable atlas. 


Remark 1. The requirement that the space be connected is not necessary, but convenient. 


Remark 2, Examples of orientable C°-manifolds are connected open domains in f,, but also 
smooth surfaces like those dealt with in Subsec. 8.1.5. An example of a non-orientable C~- 
manifold is the well-known Mébius strip. From now on we shall only consider orientable 
manifolds and orientable atlases, so that we need not mention this explicitly. 


Remark 3. Here we restrict ourselves to C'~-manifolds. In the same way it would be pos- 
sible to investigate C*-manifolds and analytical manifolds. 


Diffeomorphie mappings (Fig. 29.4). Let (J7,, 8,) and (M,, 85) be two n-dimen- 
sional orientable C~-manifolds. Let {V;.,; Fy.,} and {Vi.2; Pi} be two asso- 
ciated atlases. A topological homeomorphism (homeomorphic mapping) ® of 
M, onto My is called diffeomorphism if for all £ and 1, F,.®F 7} is a C°-mapping 
of 

Bey Fa D-1V 79) onto Fis (D Ve a Vi) 


Fig. 29.4 
and if, for all k and /, F, ,®-1F 75! is a C--mapping of 
Fy (P View N Vio) onto Fis Vat) (OS Nea) 


Here we naturally assume that Vig N®-!V12+90. The definition is meaningful 
because it is independent of the selection of atlases. 


29.1.3. Funetions on C--Manifolds 


If fis a real function on an n-dimensional C»-manifold M (that is, a mapping 
that assigns to every point of M a real number) then it is possible to transfer 
pto hs i {V,, Fe, is an atlas, then let (2) =/F pe evith eu, The 
class of all functions f for which the functions f,(x) are differentiable on U;, up to 
arbitrary order, where k=1,2,..., is denoted by C~(M). This definition does 


BY201. 29.2. Geometric Objects olla 


ee 


not depend on the choice of the atlas. Let Co(M) be the class of the functions 
p€C=(M) with a compact support, supp gy, where supp ¢ is the closure of 
{p | pe€M, ¢(p) +0} (cf. Def. 14.6.4/2). 


Lemma (partition of unity). If M is an n-dimensional C=-manifold, and if 
{Vi3 Fyhee, 1s an associated atlas, then there exist functions p,€CF(M), 0S 9, = 1, 


such that > ¢:(p)=1 for all pe M and supp 9, CV, for k=1, 2, ... fora suitable 
k=1 
choice of =k). 


Remark. This lemma can be reduced to Lemma 22.1.4. 


29.2. Geometric Objects 
29.2.1. Fibre Bundles 


Over every point y of the n-dimensional C=-manifold M we erect the fibre 
N=R,, where R, is again the k-dimensional Euclidean space. (For the time 
being, this specialization of N is irrelevant, and so is the assumption that 
one and the same fibre is erected over every point »€M.) Thus we consider the 
fibre bundle Mx N= {(u,v)|u¢M, v€N},. where (in contrast to the litera- 
ture) we dispense with introducing a topology on Mx N. We also write ay, Wu 
order to indicate that the fibre N over w¢ M is meant. 


Definition. A geometric object is a section of a fibre bundle Mx N, which means 
one-to-one correspondence uC M—>>(u)eN, (Fig. 29.5). 


vip) us 


Tee, 240).5) 


Transitivity. Let {V,; F,}¢., be an atlas of the n-dimensional O~-manifold M, 
and let U,=F,(V,) be the image of V, in R,. Then we extend the mapping 
F, from V, to VzxXJN: 

Fy (wu, )=(Fin, P)ECULXN , (1) 
where FY» is a one-to-one mapping of the fibre N over u¢€M onto the fibre V 
over F,u. In particular F;' exists as a mapping of U,xN onto V,xN. Then 
F.F;'=F,, is a one-to-one mapping of F,(V,0V,)xN onto F(V,AV)xXN 
with 

Pi (Pie, Por) = (Pip, Yr) . 
Jf VzsNViAV;+9 and G=F(V,NV,0V;), then this immediately implies 
the transitivity relation 

HOH a vi — laa) etor (x, 7)eG XN . (2) 
Local geometric object. If M is an n-dimensional C*-manifold with the atlas 
{V;;,}, then let us consider the sets U,x N. We extend the C--mappings 
F,F;' from F,(Vi.NV) onto F(V,NV;) to Fy(ViNV,)xN in a manner anal- 
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ogous to (1): the fibre N over Fu is mapped one-to-one onto the fibre 
N over Fyu, we VV). Tf the mapping extended in this manner is again de- 
noted by F);, and if (2) is satisfied, then {U; x N}¢_, is called a local fibre bundle. 
A section of this local fibre bundle is called a local geometric object. 


Theorem. A geometric object ts transformed by (1) tnto a local geometric object. 
Every local geometic object can be obtained in this way from a geometric object. 


Remark 1. The geometric object which generates a given local geometric object according to 
(1) is of course not unique, because the one-to-one mappings of the fibres V onto one another 
are largely arbitrary. It is also decisive that we have not topologized MxN and U;,x<N. 
For the above theorem is not generally true if topologies are involved, where, for the mo- 
ment, we do not explain precisely what this means. For us it 1s of importance that the theo- 
rem enables us to identify geometric and local geometric objects. From now on we can 
restrict ourselves to local geometric objects given by explicit transformation rules F;, that 
satisfy (2). 


Remark 2. In Chaps. 29 and 32 we shall give a. description of the geometry on manifolds 
which is sufficient for our purposes. More detailed presentations that are addressed espe- 
cially to readers who are interested in physical applications can be found in [1, 9]. We also 
refer to the standard book [33] as well as to [12, 60]. 


29.2.2. Tensor Densities 


As usual, let M be an n-dimensional C»-manifold. If {V,; F;} is an associated 
atlas, then let again U,=F;,(V,) be a domain in R,. We now consider arbi- 
trary U, and U;,/+k, denoting the Cartesian co-ordinates by x* in U, and by 
a’* in U, (Fig. 29.6). Then we write F,F;!' as x’*(x*). As in Subsec. 24.1.2., 
cer 7) 
a(x? ) ( 
summation convention referred to in Subsec. 24.1.2. 


> 0) is the associated Jacobian determinant. Further we again use the 


(x9 

Uy 
x2 (x) xPox'%) 
(x) 

Y, 


Fig. 29.6 
Theorem. Jf w ts a real number, and af RK Ke OA, 2, 2, 08 the above fibre 1, 
then T*’, 4 (where the K parameters a, ...,6,¢,...,d run from 1 to n each) is a 


geometric object if 
P'G.b x’ = me = i q 1 
el ) A(a%) pat Axi ax’ bx Heel ) ( ) 
holds for any pair of local charts U;, and U; of the above form which belong to 
arbitrary admassible atlases (tensor density fields of rank K, with the weight w and 
with Ky, contravariant components a, ...,b and Ky covariant components c, ..., d, 
where K,+Ay=K). 


A(x”) J’ Gx" Ge? oe" Ox" 
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Remark 1. According to Theorem 29.2.1 we need not distinguish any longer between geometric 
and local geometric objects. In this sense we suppose that an 2“-tuple 7-2, a(x) of real 
numbers be given at every point a€U,. If 2’¢ Uy. then, for the purpose of distinction, we 
denote this x*-tuple by T’%-..). q- Then the theorem reads as follows: if (1) holds, then 
the transitivity condition (29.2.1/2) is fulfilled, and we obtain a local geometric object that 
can be transformed into a geometric object according to Theorem 29.2.1. The transitivity 
condition is in effect a consequence of the chain rule. 


Remark 2. If w=0, then we speak of tensor fields, as we did in Subse. 24.1.3. 


Terminology. From now on we shall use the term “tensor” instead of “tensor 


field”, and the term “tensor density” instead of “tensor density field”; cf. 
Subsec. 24.1.3. 


O(a?) \v 
a ao 
tensor density of rank zero (and of weight w). It isa geometric object with N= R,. For w=0 
one obtains fnnetions. If K=1, that means N=R,, then 7, satisfying (1) with w=0, is 
called a contravariant vector, and 7", satisfying (1) with w=0, is called a covariant vector; 
cf. Subsee. 24.1.3., especially the mnemotechnical remark stated there. 


Remark 8. In the sense of (1), we shall call the scalar density me) =( 


29.3. Tensor Analysis 
29.3.1. Fundamental Operations for Tensor Densities 


We are now going to generalize Subsec. 24.1.4. A tensor density is said to be 
of the type (4,7; 2) if it has& contravariant and / covariant components and is 
of the weight w. If w=0, then we write (f, /) instead of (4, 1; 0). Let G, be the 
group of permutations referred to in Subsec. 28.6.2., where (—1)?, p€G,, has 
the meaning indicated there. 


Theorem 1. (a) Zf T*?. 4 and St”, 4 are tensor densities of the type (k, 1; w), and 
if 2 and uw are real numbers, then AT, g+pS* 4 ts a tensor density of 
the sume type. 

(iyi; T°”. 4 is cmtensor dewsily of the type (k),1,; w)), awd if S*-”,., is a 
tensor density of the type (ko, ly; wo), then T%*, gS-?,, is a tensor density 
of the type (ky + ho, Uj +h; wyt wo). 

(c) Tf T*~?, 4 is a tensor density of the type (k,l; w), then 


1 
ESL = a > LN (1) 
‘PES 
and 
i ue 2 
pied) = a Sh Oe (2) 
PEN p 


are tensor densities of the same type. An analogous proposition is true for the co- 


rarvant components. 
i) (Contiaction). if LF" 4 4 ts a tensor density of the type (k,l; w), 


l—1; 2). 


Remark J. Remarks t and 2 of Subsec. 24.1.4. hold analogously for tensor densrttes, too. 
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Remark 2. A tensor density 7-2, g is called symmetrical if 74--?,. g=T7@-™, 4, and 
antisymmetrical if Jl], g=7*->, 4. By way of example, 7%? is symmetrical if 
pe — 74, and antisymmetrical if 72 = — 74. 

Theorem 2. (a) 6%, with 6%,=1 for a=b and 6°,=0 for a+b, ws a tensor of the 
type (1,13; 

(b) 2%? (n indices), with e®?=(—1)? af (a,..., |) =p, «.., 2) Us @ permilla- 
tion of the numbers 1, ..., n, and e*’ =0 otherwise, is a tensor density of the type 
(n, 0; 1) (Levi-Civita tensor density). 

(e) If Tq, ts a tensor density of the type (0, 2; w) with det 7',, +0, then (det T',,)” 
is a scalar density of weight x(nw+2). Here x ts a real number. 


Remark 3. det 7',, is the determinant of (Denis In the sense of (29.2.2/1) one has 


: O(a?) nw +2 
(det i) = (sea) det ie ‘ (3) 
_ _ O(a?) 
Since ae’) the requirement that det 7,,>0 is meaningful and independent of the 


choice of the local co-ordinates. 


29.3.2. Differential Operations 


If pe C~(M), then (keeping the notation) p can be regarded as C*-function on 
the local charts U,=F,(V;,). All the symbols used have the previous meaning. 
7) 2 : E ve 
Asin Subsec. 24.1.3., let e=54 P P= aa etc. It is now easily verified 
that p,, is a covariant vector. On the other hand, ¢,,; is (generally) not a cova- 
riant tensor. 


Theorem. (a) lf 7; is a covariant vector, then Ty,,— Ty, ts a tensor of the type (0, 2) 
(curl, or rotor, of T';). 

(b) If T* is a tensor density of the type (1,0; 1), then T*,, 7s a scalar density of 
the weight w=1 (divergence of T*). 

(ec) If T* is an antisymmetrical tensor density of the type (2, 0; 1) (that is, ae 
T= —T"), then T*, is a tensor density of the type (1, 0; 1). 
Remark. In the above theorem as well as in our further considerations we always assume 
that the components of vectors, tensors, tensor densities etc. are differentiable up to arbi- 
trary order (with respect to the local co-ordinates). As M is a C»-manifold, this assumption 
is reasonable. 


29.3.3. Integrals on Manifolds 


If {V;,; F,} is an atlas for the n-dimensional C»-manifold M, then let {o,}7_, be 
a corresponding partition of unity in the sense of Lemma 29.1.3. As previously, 
let U,=F,(V,). Further suppose that the support of a tensor density 7%", , 
on M be the closure C of the set C, where O consists of all those points mé M for 
which at least one component 7”, 4(F,m) is different from zero (where it is 
assumed that m¢€V,). Since tensor densities are linear geometric objects, this 
definition is independent of the choice of the atlas. 
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Definition. 7f f 7s a continuous scalar density of weight 1 with a compact support on 
M, then 

ffdm= ZS ge) (Pete) de. (1) 


M k=1 k 


Remark 1. As f has a compact support, it can be expeeted that only finitely many integrals 
on the right-hand side of (1) are different from zero. Further, (1) is independent of the 
choice of the atlases and the functions g,. This is in effect an implication of Theorem 9.2.2, 
if one takes into consideration that the Jacobian determinants of the mappings considered 
here are always positive. It is also observed that the weight w=1 is necessary. 


Theorem. /f T* is a (continuously differentiable) tensor density of the type (1, 0; 1) 
with a compact support, then f T*,, dm=0. 
M 


Remark 2. Theorem 29.3.2(b) shows that the above definition is applicable to f= 7"... The 
theorem is in effect an implication of Theorem Qrsetl/2: 


29.4. Affine Spaces 


29.4.1. Affine Transformations 


The symbols in the following theorem have the same meaning as in Subsec. 
222: 
Theorem. Jf Ry: is the fibre N, then If, (where the indices a,b, c run from I ton) 
1s a (nonlinear) geometric object if 

RC at On os, Gx? ay 
el) aes gab age el) + ar aera © 

holds for every pair U, and U, of local charts (affine transformation). 
Remark 1. For the notations, see also Remark 29.2.2/1. To prove the above theorem, one has 
to check the transitivity relation (29.2.1/2). 
Remark 2. It is immediately seen that Te = ie is a tensor (torsion tensor.) An affine trans- 
formation is called symmetrical if Dic= Fy. 
Definition. An affine space consists of an (n-dimensional orlentable) O-manifold 
and a symmetrical affine transformation. 


Remark 3. We thus restrict ourselves to “torsion-free” affine spaces. 


29.4.2. Normal Co-ordinates 


Theorem. /f M is an affine space and if me M, then there existan atlas {V,; Fide, 
tn the equivalence class of the associated atlases and a number l such that meV, and 


eG Oeor laebne=0 did 2,=Fyn. (1) 


Remark. The Cartesian co-ordinates x in the local chart U;=F((V;) are ealled normal co- 
ordinates with respect to %. The existence of such normal co-ordinates is proved by utilizing 
the nonlinearity in (29.4.1/1). The theorem looks harmless, but the normal co-ordinates 
are an extremely efficient tool in affine spaces. 
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29.4.3. Covariant Differentiation 


If ¢ is a function, then ¢,; is a covariant vector. Higher-order derivatives of ¢, 
however, in general do not yield geometric objects. This is also true for arbitrary 
(including first-order) derivatives of tensor densities. Therefore we have to look 
out for a substitute which on the one hand exhibits typical properties of partial 
derivatives, while on the other hand it transforms tensor densities again into 
tensor densities. This is the covariant differentiation, which is denoted by a 
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semicolon ‘; 
Theorem. Jf 77°. isa tensor density of the type (k,l; w) in an affine space, 
then 
Q...D nocd) af] and) po oti 
i Coll 2 = dug C...8'€ +1 ae Bag ir ove ar ele ne (1) 
badd! eeu Tr 1pa...d 
= ae Tic 0) a Pee CoP I Gerth 


us a tensor density of the type (k,1+1; w). 
Remark 1. Here (1) is again understood as a representation in local co-ordinates; the compo- 
nents of 7'%--?, 4 are assumed to be differentiable up to arbitrary order and, as usual, one 
has to take the sum over doubly occurring indices. 
Remark 2. If ¢ is a function, then we also write y,,=9.,. Examples of (1) are 
aa 
gpl ye = ue Af re ae 
for a contravariant, and‘ 
: 
To a Tae eel dl 
for a covariant vector. 


Remark 3. In normal co-ordinates one has 
wel py) ee filles : 
fie b...d 3e(X0) =7? aed) : 


Hence in particular it follows that the usual calculating rules for partial derivatives, espe- 
cially the Leibniz theorem, are also applicable to covariant derivatives. 


Remark 4, From Remark 3 it follows that 6%..=0. 


29.4.4. Translation 


Let M be an affine space, and let U be a local chart of M in R&,,. Consider a 


smooth (once continuously differentiable) curve yin U, which is assumed to be 
kb 


: : : 4 Oe 
given by ic'(z), where 0271, and £— 1, 12,9 (Big, 29. ge The tangent a 
dt 


transformed like a contravariant vector under a co-ordinate transformation 

ae dis *® sor dc! . 

ef sees == = — > —, 
dr Ox! dr 


Definition. (a) A contravariant vector T* is called parallel along y if 


ae 
fis =-=0 jor Pal, a8 ond. Vara, (1) 


e 


(b) A covariant vector 7), is called parallel along y tf 


Bet =9 jor “k= ane 20S7= 1, (2) 
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Wey 29.7 x* (0) Pig. 29.8 


Remark 1. Let us recall that the term ‘vector’ always means ‘‘vector field’, where the 
; dat 

components are assumed to be differentiable up to arbitrary order. As as transform- 

dr 


ed like a contravariant vector, the definitions are independent of the choice of the charts. 
In particular the definition can immediately be extended to curves y that do not lie in one 
chart. 


Remark 2. (1) and (2) are systems of ordinary differential equations of first order for 7", (1) 
yields 


= ee gr (3) 
dr ¢ 
where one has to insert #*=«*(r). Tf T*(x!(0)) is given, then from (3) one obtains a unique 


solution 7*(x!(r)): the vector 7 (x'(0)) undergoes a unique parallel shift along y (Fig. 29.8) 
An analogous result is obtained for (2). 


Teleparallelism, M is called a space with teleparallelism if this parallel shift is 
path-independent. The conventional translation in 2, exemplifies this. Generally, 
however, the parallel shift depends on the path y. 


29.4.5. Affine Geodesies 
Definition. A smooth curve y~.2x*(r) (Fig. 29.9) in the affine space M is culled un 
affine geodesic tf 


dee alia dae (1) 
dr? u dr aueee 


for k=1,..., n and for a suitable parumeter t. 


da* 


Remark 1. If 7* is a contravariant vector that coincides with along y (there will al- 


dr 
ways exist such vectors), then (1) and 
dx! i 
*,——=0 along y (2) 
sae: 


are equivalent. This follows from (29.4.4/3). Roughly speaking, this means that y is an 
kt 


i ic I 7 i sovariant ent vector -— is parallel alone y. 
affine geodesic if and only if the covariant tangent vector dq 8 Pa allel along 4 


x*(r) 
gk 


Fig. 29.9 Xo Fig. 29.10 
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Remark 2. (1) is a system of ordinary differential equations of seeond order. If U is a local 
chart, and if zj¢U and é*¢€ R,, then locally there exists exactly one affine geodesic x(t) = 


da* 
=(a#(z)) such that 2(0) =a) and —(0) = (Fig. 29.10). 


Remark 3. If a=o(r) is a strictly monotonic, differentiable function, then y ean also be 
parametrized by o. Then (1) reads 


deh Sande de ea de” 
lo! 2 0e oe Aene 


Tf one wants to have the form (1) again, with ¢ instead of t, then this requires that a” =0. 
Such parameters are called affine or geodesic. 


Theorem. ¢ is a geodesic parameter for the affine geodesic y if and only if o=et +d. 
Here t is the parameter of (1), e and d are real numbers, c+0. 


29.4.6. Riemann’s Tensor 


8 
Theorem. /f M its an affine space, then 
BM ca = Use = Uoare +L Gel be — Ver! ta (1) 
as a tensor of the type (1, 3) (Riemann’s tensor). If T* ts an arbitrary contravariant 
vector, then 
6 . 
Poca — TY 3c = B yeqT™ » (2) 
Remark 1. If 7%,,=S*,, then 7,7 =S*%,.q means twice repeated covariant differentiation. 
Remark 2. (2) is the starting point, whereas formula (1) gives an explicit expression, which 
0? 0? 
is, however, not so very interesting. As is well known, —_—j=———— . Hence B%,., in 
se : Cnn aoe se 


(2) is a measure of the degree to which the covariant differentiation differs from the partial 
one. An affine space JZ is called locally Euclidean if [%,,=0 is achievable with a suitable 
choice of an atlas. Then 6%,,4=0, the covariant derivative and the partial one coincide. In 
the general case %;,q expresses the degree to which J differs from a local Euclidean space, 
i.e., the “curvature” of A7. One might attempt to replace JT in (2) by a scalar function f. 
However, it is easily seen that f.9,=f.5q. Hence a deviation from the familiar eommutative 
law is encountered for the first time for vectors. 


Remark 3. In normal co-ordinates for x, (1) reduees to 

Be yea %o) = 1 e-a{@o) —L5arc(o) - (3) 
Symnetries: Using (3), one shows that 

Bsa = — B hae 5 

B ycay=9 = Ba + B cay + Bra, » 

ca — 0 = WS cata D nae pecsue (4) 


(4) is called Bianchi identity. 
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29.4.7. Flat Affine Spaces 


Definition. Let M be an affine space. 

(a) M is called flatif B*,.4=0. 

(b) M ts called locally Euclidean 7f in the equivalence class of the associated atlases 
there exists an atlas {V,; Fy}~, such that =) jo 260, — i (Ve 

(e) A contravartant vector T* is called constant if T*1=0 for k=1,...,n and 
ae 


Remark. The terms (a) and (b) express additional properties of J that make AZ closer re- 
semble the Euclidean space R,. Let us recall the concept of teleparallelism mentioned in 
Subsec. 29.4.4. With 77%,,=0, R,, is flat, locally Euclidean, and exhibits teleparallelism. 
Moreover there exist nontrivial constant vectors. 


Theorem. If M is an affine space, then the following four propositions are equiva- 
lent. 1. M is flat; 2. M 7s locally Euclidean; 8. M exhibits teleparallelism; 4. If U 
as a local chart, and if x9€ U and €* € R,, then there exists exactly one constant contra- 
variant vector T* such that Ces, 


29.5. Metric Spaces 


29.5.1. Fundamental Tensor 


By a C*-manifold we shall always understand an n-dimensional orientable 
connected manifold (cf. Def. 29.1.2) without continually referring to these ad- 
ditional properties. On such a manifold M, let us consider a tensor gy, of the 
type (0, 2) with the properties 


Gu=Jx (symmetry) and det g,,+0. (1) 


Suppose that the components of g,,; are differentiable up to arbitrary order. 
det gy is the determinant of (g,1)f;-1, which is a scalar density according to 
(29.3.1/3). Henee it follows that (1) (including the symmetry) is independent of 
the choice of local charts, and hence meaningful. If mé M, then it is possible to 
find an admissible atlas {V,; F,}¢_,, with m€ V,, such that g,; (being interpreted 
as a symmetrical matrix) assumes the normal form 

1 0 


Mk 
(Gi(Xo)) = = in (2) 
S_ 


S4 


0 “1 

at the point x,=F,m. This is a proposition of analytical geometry. In this case 
s, and s_ (the numbers of the 1’s and —1’s) are invariant: every admissible 
transformation that leads to (2) yiclds the same numbers. For reasons of con- 
tinuity it follows that s, and s_ are also independent of xg, and henee represent 
characteristic numbers on M. The terms=s, —s_ =2s, —n is called signature. 
Two cases are of special interest: 

Ss=oi : positive definite metric , 

=n —2: Lorentz metric , 
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where gj; 18 also called metric on M. Hf y,, is a Lorentz metric, let us consider the 
zero manifold of 


n—-| 
Gilt) ES! = 2 (é7)2—(€")2 in R,. 
— 
It is a circular cone. 
n—1 


Conoid (Fig. 29.11): A conoid is obtained from the cone > (&")2=(€")2 by a one- 


all 
to-one mapping 7 = 7(£) that is defined and differentiable up to arbitrary order in 
a neighbourhood of 0€ #,,. 7(0) is called the origin of the conoid. 


(3) gg 


i 


v * 
a 


| 
| Fig. 29.11 


Congruence of Lines (Fig. 29.12): If M isa C=-manifold with the Lorentz metric, 
then there exists a local conoid at every point x€ U;, of the local charts G,=f(V 7). 
The directions of the local axis of symmetry at the origin of the conoid generate 
on M a congruence of lines that is free from singularities and covers M simply 
and without gaps. ' 


Theorem. Let M be a C»-manifold. 

(a) Zn any case there exists a positive definite metric on M. 

(b) There exists a Lorentz metric on M if and only rf there exists a (simple, gup- 
less) congruence of lines that is free from stngularities. 

(c) Lf M is not compact, then there exists u Lorentz metric. 


Remark 1. According to (b) there is no Lorentz metric on a sphere in 3, whereas there exists 
a Lorentz metric on a torus (see Fig. 29.13). 


Remark 2. 9;; with the property (1) is called fundamental tensor. Here 9x1 is thought to be 
fixed on M. Confer Subsec. 24.1.2. for reference. j 


(x) (x') 
x(x) CG 
EBB 2 
x(x) CM Pie. 29.13 


29.5.2. Index Shifting 


Let M be a C*-manifold with a given fundamental tensor g,7. 
Lemma. Ff g!"7s determined from g"qim = 5° n, then g* is a tensor of the type (2, 0). 


Remark. Confer Theorem 2(c) as well as Remark 3 in Subsec. 24.1.4. 


or 
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~ oad 
La = ee 


Index shifting. If ae ar is a tensor density, then we consider the tensor den- 
sities 

js ee aa Gorm aps EE, == gy 
to be not essentially different from T*~ ;. By means of the fixed tensors Jad 
and g”’ it is thus possible to raise and to lower indices. Then there are covariant 
and contravariant versions of tensor densities etc. These procedures of index 
shifting must always be stated explicitly in each particular case, including the 
points where the indices have to be placed, in order to avoid nusunderstandings. 


29.5.3. Characteristic Surfaces 


Definition. /f M is a C>-manifold with a Lorentz metric g*, and if w(x) is a O°- 


n 
function in M, then Q={xr| w(x)=0}, with D> w3,(x)>0 for x€Q, rs called a 
r= 
characteristic surface if g(x) w(x) wn(xc)=0 for rE Q. 
Remark 1. This is a definition in local charts. Since w,, is transformed like a covariant vec- 
tor, it is independent of the choice of the local charts and can in particular be transferred 
from one local chart to another. Hence the above definition is a global one. 


Theorem 1. /f U is « local chart, then for every point xy€U there exists exactly one 
conoid with origin x, that is a characteristic surface (characteristic conoid). 


Remiark 2. x, does not belong to this characteristic surface, because the latter has a singu- 
larity at this point. 
Remark 3. If JJ=,. where gj; has the form (29.5.1/2) with s_=1, then the cones 

al 


O=aleexye= 2 (z*)2?—(27)2 
aa : 


are the desired characteristic conoids. For n=4 (Minkowskian space) one obtains the light 
cones of the special theory of relativity. 


Rays. If x,+ x, is a point on the characteristic conoid for x», then the character- 
istic conoids for xy and x, touch each other along a C*-curve xr‘(z) (Fig. 29.14). 
These curves are called rays. Thus one obtains a distinguished congruence of lines 
on the characteristic conoid. In the case of the cone referred to in Remark 3, the 
curves in question are straight lines through the origin of the cone. 


al ie 


YW 


0 


Fig. 29.14 
2 er dz* dx' ( 
Theorem 2, If x"(t) ts a ray, then gr oe os 2; 
ys é ike i ravariant vector, the above proposition is 
Remark 4. As ae is transformed like a contravarian ‘ 
meaningful (independent of the co-ordinate system). 


23* 
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29.5.4. Metric Geodesies 


If M isaC*-manifold with a given fundamental tensor gj, then the Christoffel 


symbols {jk, 7} and 12} have the same meaning as in the formulas (2) and (3) of 
Subsec. 24.1.5. 


Definition 1. 7f M is an n-dimensional C»-manifold with a Lorentz metric, then 
every C™-curve x"(s) that solves the system of differential equations 


d2x" in fa da? dx’ 0 
ds? \be ds ds’ 


is called a metric geodesic (line). 


GaN, eee Bg (1) 


Remark 1. This definition is an extension of Theorem 24.1.5 to manifolds with a Lorentz 
metric. Of course it is also possible to generalize the definition to manifolds with arbitrary 
fundamental tensors, but it is of no interest for us to do so. 


Remark 2. If U is a local chart, and if a¢€U and &*€ R,, then locally there exists exactly one 


da* 
metric geodesic «(s) ~#*(s) such that #(0) =a) and Fee oS 
Lemma. /f x*(s) is @ metric geodesic, then there exists a real number ¢ such that 
dx* dx’ 
x’(x)) ——-=c. 2 


Definition 2. A metric geodesic is called time-like if c<0, null geodesic if c=0, and 
space-like (geodesic) if c=0 in (2). 


Remark 3. If x¢U (chart), then a contravariant vector & is called time-like (at the point x) 

if ger(x)S*s'<O, null if gyg*é4=0, and see sale if gyi(w)£*é! > 0 (Fig. 29.15). Hence a metric 
di 

geodesic is time-like if its tangent vector Fr is time-like. Analogous statements are true 

for the null and space-like properties. Time-like vectors point to the interior of the charac- 

teristic conoid. 


Remark 4, The time-like metric geodesics (and, with a suitable modification, the space-like 

ones too) can be derived from a variational principle, as was done in Subsec. 24.1.5. If 
da* 

a(t) ~2*(r) is a time-like curve, i.e., if — is always time-like, then let 


dt 


fa da* da! 
Large, a I —9xi((7)) FE a dr (3) 
% 


be the are length of a(r) (Fig. 29.16). Further let %=#(To), e,=2(t,), and to<7,. It is easily 
seen that (3) is invariant under co-ordinate transformations a/* =a'*(!) and under parame- 
3 F k 

time-like te 


ene sy 
§ 


X 


space -like 


Tei, 2) 1155 Fig. 29.16 
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PS ee 
ter transformations t=7(s). s can be so chosen that (2) holds with c= —1. Then ZL, = 
ees 3 . ee : : tom 
i Sil, ie: sis the are length. It is unique except for 8>8+8 and s+ —s+ 5p; sp real. As 

1 5 

in ubsec, 24.1.5., one can now look for curves x*(z) of extremal arc length between two 
points. This is a variational problem, and as a result one obtains (by analogy with Theo- 
rem 24.1.5) time-like metric geodesics. 


Remark 5. Metric geodesics are identical with the solutions of the following systein of dif- 
ferentiale equations (canonical form): 


dx* OH dé, oH Le 
ds 0% ds aah? oi Sets. ) 


Theorem. The null geodesics coincide with the rays of Subsec. 29.5.3. 


29.5.5. CGeodesically Convex Domains 


Definition. /f M isa C-manifold with a Lorentz metric, then an open set Din M is 
called geodesically convex (or extremal-convex) tf any two points of D can beconnect- 
ed n D by exactly one metric geodesic. 


Remark 1, This is the analogue to the concept of convex domains in R,,. 
Theorem. Every point of M has a geodesically convex neighbourhood. 


Remark 2. The theorem is plausible but not trivial. 


29.5.6. Metric Spaces 


. ; ; ; a 
Lemma 1. /f M is a C-manifold with the fundamental tensor gj, then ['2, = 

: : Rye 8 be 
is an affine transformation in the sense of Theorem 29.4.1. 


Remark 1. (ea is the Christoffel symbol of (29.1.5/3). A comparison of the affine geodesics 


of (29.4.5/1) and the metric geodesics of (29.5.4/1) now suggests the following definition. 
Definition 1. A metric space is an (n-dimensional orientable) C@-mantfold with a 


a 
fundamental tensor g,, and the (symmetrical) affine transformation Ij, = ta j 


Remark 2. Consequently, metric spaces are special affine spaces (this definition differs from 
the usual terminology). All the considerations on affine spaces of Sec. 29.4. can also be 
applied to metric spaces, especially the concept of the covariant derivative. Since affine 
and metric geodesics are identical, from now on we shall only speak of geodesics. The term 
“metric space” is customary, but should not be confused with the metric space of Subsec. 
1.3.3. A Lorentz-metric space is a metric space with a Lorentz metric. 


Mere tage, 7—9 4=0 7, = 0. 


Remark 3. This means that the fundamental tensor behaves like a constant under covariant 
differentiation. 


Lemma 2 (normal co-ordinates). If M is a Lorentz-metric spuce, and if me M, then 
there exist an atlas {V,; F,} in the equivalence class of the associated atlases and « 
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number | such that meV, and 


1. 0 
Yar(Xo) = = 1 > Gavre(Xp) = 9, ay=Fym . 
0 —1 
: : fa] ? ty : : 
Remark 4. Tt is easily seen that then \be (%) =0. This connects our considerations to the 
normal co-ordinates of Subsee. 29.4.2. 


Definition 2. A Lorentz-metric space is culled metric-flat if, in a suitable atlas, 
1 0 

Farle) =| 9 a a : ’ 

Theorem 2. A Lorentz-metric spuce is metric-flat if and only tf it ts flat. 


Remark 5. This is a supplement to Subsec. 29.4.7. 


29.5.7. Riemann’s Tensor and Related Tensors 


In a metric space we write eq = BY,-q for the Riemann tensor of Subsec. 29.4.6. 
Further let 


Eigen =Gar le’ bea 3 
Theorem 1. 
Lavea = SE Lnsee ae Le = Tira ’ 
Fatoca] =0= ie ar Tvncar = The ’ 
Ravfedyey == Rarcaie + Rasaere + Rater a (Bianchi’s identity) . 
Remark 1. The last two formulas are derived from Subsec. 29.4.6. 
Remark 2. In view of the symmetry relations, the number of independent components of 
1 
Ravea reduces to 12 n? (n2—1). For n=2, all the components of Hz;,q can be expressed by 
Fiato. 


Definition. Ricci’s curvature tensor rs defined as Ra, = R' ay or R°,=9"R,,. The 
curvature scalar 18 defined as R= R*,. Einstein’s tensor ts defined as Gy= 


il 
he == ga, or CG", =9"G,,. - 


Theorem 2. One has Ryy = Ryq (and hence also Gan =Gya). Moreover, G%, =) 


Remark 3. G%.,=0 is a fundamental equation. It forms the basis of the general theory of 
relativity. 


Remark 4. The Riemann tensor and the Ricci tensor have the forms 


Favea= % (Gacrba + Yoarac — Gadde —Jocrad) 


+9"*({ac, 7} {bd, s} ~{ad, P} Woe, PY) . 


Ba esha href 0} {0 ff s 


Hence Raya, Ray, R, and Gay are differential expressions of second order, which are linear 
in the derivatives of second order, but not in Jap and its first derivatives. 
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Einstein on the concepts of space and time: 
“which the physicists, under the stress of facts, had 
to get down from the Olympus of a-priori in order 
to repair them and to restore them to a usable 
condition.” 


30. General Theory of Relativity T (Fundamental Equations ) 


30.1. Variational Principles 
30.1.1. Lagrangian Formalism 


How to derive field equations from variational principles has been described 
in detail in Sec. 24.2. Therefore we restrict ourselves to a few remarks which 
link those previous considerations up to Chap. 29. Given a O*-manifold M which 
shall be transformed by a fundamental metric tensor g,; to give a metric space in 
the sense of Def. 29.5.6/1. In contrast to the absolute space-time theories of 
Subsec. 24.2.1., however, gg; is not given but rather a dynamic object. But Jt iS 
required to be always a 4-dimensional Lorentz metric in the sense of Subsec. 
29.5.1. (n =4 in the notation used there). Hence it is general space-time theories 
which are considered. The investigations described in Sec. 24.2. were local ones. 
Consequently the change from the domain 2 fixed there to an arbitrary O*-mani- 
fold M does not raise any difficulties. Lagrange densities for curves are defined in 
the same way as in Def. 24.2.2(a). For time-like curves, (24.2.2/4) represents an 
example we have already made use of in Remark 29.5.4/4. Def. 24.2.2(b) is ex- 
tended in an obvious way as follows. A Lagrange density for geometric objects is 
a scalar density of weight w=1 (cf. Subsec. 29.2.2.), having the form 


r r 
koorpkd Secel! 
L= L(x me Podlt 2 ap poste , 


; 5 
where 7*, are the components of geometric objects (tensor densities and 
affine transformations being sufficient), 7=1, ..., W. The requirement that L bea 
scalar density of weight w=1 is identical with (24.2.2/2). If g,) is a Lorentz met- 
ric, then, according to Theorem 29.3.1/2, 


L=V—g with g=det gy 
is a Lagrange density; cf. (24.2.2/5). Then (24.2.2/8) is also an example of a 
Lagrange density, @” and H;, being tensors as defined there. The Lagrangian for- 
malism can be taken over from 24.2.3. without any change. In particular the 
Euler-Lagrange equations (24.2.3/6) are applicable. 


30.1.2. EHinstein’s Equations 


tiven a 4-dimensional manifold M, a Lorentz metric gy ;, and an affine trans- 
formation If, on M. Let A be a real number, then 
Ww 


L=V—g(R-2A) with R=g"Cha—Ceat lelh —CE)) 
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is a Lagrange density with the dynamic objects g and I¢,: from Theorem 
29.4.6 and the usual contraction procedures it follows that F is a scalar. 


Theorem. The Kuler-Lagrange equations (24.2.3/ 6) read 


Ph= |p). OM + Apt =0. (1) 


Remark 1. ‘eal is the Christoffel symbol used in Subsec. 24.1.5. and G*! denotes the Einstein 


tensor as defined by Def. 29.5.7. This skilful decoupling of the Lorentz metric gxy and the 
affine transformation J}, is called Palatini’s method. Naturally it is hoped that the 
“correct” affine transformation Ty, is then established automatically, so that M7 is a metric 
space in the sense of Def. 29.5.6/1. Then, according to Subsec. 29.5.7., R is the usual 
curvature scalar. 


Remark 2. The second set of equations in (1), i-e., 
i 
RY _— Rgk + Agi! —0 (2) 


are the famous Einstein field equations for the vacuum. With A=0, these equations 
(including the energy-momentum tensor referred to in Remark 3) were set up by Einstein 
and Hilbert almost simultaneously in 1915. The elegant derivation from a variational 
principle is due to Hilbert. Later on Einstein added the term Ag" with the cosmological 
constant A in order to avoid (as he believed) undesirable cosmological consequences. He 
believed in a static (time-invariable) universe (and so did, presumably, all the physicists of 
those days). Formula (2) with A=0 did not lead to this result. After Hubbles’ discoverios, 
which suggested an expanding universe, Einstein regarded the introduction of the cosmo- 
logical term Ag"! as a serious error. 


Remark 8. In the general case, Einstein’s equations read 
veal 
SS Rg! + Ag! =xT*! : (3) 


where 7*! denotes the energy-momentum tensor. It includes all external influences such as 
masses, electromagnetic fields etc. x is a coupling constant. (29.5.7./1) shows that (3) is 
a set of 10 partial second-order differential equations. It contains the second derivatives 
of gy; linearly, in contrast to ggi and their first derivatives. (3) also shows that the geo- 
metry (given by g*’) and matter (given by 7") are closely coupled: Mach’s principle. 


30.1.3. Kinstein-Maxwell Field Equations 


The question arises whether the general equations (30.1.2/3) can also be deduc- 
ed from a variational principle. In special cases this is possible. For an elec- 
tromagnetic field, the Maxwell-Lorentz equations had been derived from a 
variational principle in Subsec. 24.3.2. The function given in (24.3.2/1) is also a 
Lagrange density in the sense of Subsec. 30.1.1. (cf. Theorem 29.3.2 (a)). Thus, 
keeping the notation of Subsecs. 24.3.2. and 30.1.2., it suggests itself to combine 
the Lagrange densitics used there, 


— R fee lel 
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with 
Rag" (Vbor— Mg + P40, - Dl) 


Again L is a Lagrange density. The dynamic objects are Gl eons 
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Theorem. The Lagrange equations (24.2.3/6) read as follows: 


Py = Ap, — Ap, , (V~g Fe) =0, (1) 
a Seno! : a 
7 el We) 4g! eT" (2) 
where 
ee i orn 
7h = PEP —— G EE ee (3) 


Remark 1. The first set of equations in (2) again represent the “correct” affine transfor- 
mations so that VW again becomes a Lorentz-metric space. The equations (1) are the Maxwell 
equations of Subsec. 24.3.2. in curved space-time coordinates. (3) is the usual energy tensor 
for the Maxwell field, if the interpretation of (24.3.3/1) is used here, too. Then the Ein- 
stein-Maxwell equations are made up of the second sets of equations in (1) and (2): this is a 
coupled set of 14 partial differential equations for g*! and F*!, where F;; has the structure 
of the first equation in (1). In particular it is seen that the Lorentz metric 9x1 Of the space 
is influenced by the eletroemagnetic field. 


Remark 2. As F’ is antisymmetric and V—g is a scalar density of weight w=1, it follows 
from Theorem 29.3.2 that 


<a (V—g F*),,. (4) 


Observe that the contravariant vectors on both sides of (4) coincide in normal co-ordinates 
(cf. Lemma 29.5.6/2). Hence, the second set of equations in (1) can also be written as 
F*'=0, These are the Maxwell equations in the vacuum (without electric charges and 
currents). If charges and currents are present, then the current density j and the charge 
density 9 referred to in Remark 24.3.3/3 are combined into the four-vector of current, 
o*, The Maxwell equations then read as follows: 


ao" (5) 
Remark 8. If it is assumed that F*! influences the Lorentz metric only insignificantly, then 


the Einstein- Maxwell equations can be decoupled: in this case (5) and the first equation of 
(1) are considered for a given Lorentz metric g;; (and a given four-vector of current). 
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Newton, you found the only way which, 
in your age, was just about possible 

for a man of highest thought and 
ereative power. (A. Einstein) 


Einstein during a lecture in Pasadena, 1930. 


30.1.4. Some Remarks Einstein Made on Relativity aud Quantum Theory 


In ref. [43], § 17.7, one may find quotations from fundamental papers by 
Einstein and Hilbert, which illustrate the creation of the general theory of 
relativity. Some of Einstein’s most important and most famous papers on this 
subject are included in [84]. In a very impressive way these papers illustrate the 
final phase of the struggle for a definitive formulation of the fundamental equa- 
tions of general relativity, i.e., of Kqs. (30.1.2/3) (with A=0). Here we quote 
some passages. (We have changed part of the notation to fit it to that used in 
this book.) In “On General Relativity” (Prussian Academy of Sciences, Proceed- 
ings, Nov. 11th, 1915) Einstein wrote: 

“Like special relativity is founded on the postulate that its equations shall be 

covariant under linear orthogonal transformations, the theory to be described 

here rests on the postulate of the covariance of all sets of equations under 
transformations with the substitution determinant 1. Hardly anyone who has 
really comprehended this theory will be able to resist its fascination; it means 

a true triumph of the method of the general differential calculus established 

by Gauss, Riemann, Christoffel, Ricci, and Levi-Civita ... On the basis of the 

statements made so far, it suggests itself to set up the field equations of gravi- 
tation in the form 


0 anf 
R =KT ys 


fe) 


because we already know that these equations are covariant under any trans- 
formations with determinant 1.” 
(Here R°, means part of the Ricci tensor Fy)» In “On General Relativity 
(Supplement)” (Prussian Academy of Sciences, Berlin, Proceedings, Nov. 18th, 
1915) Einstein refers to the preceding paper as follows: 
“In a recently published investigation I have shown how a theory of gravita- 
tional fields can be founded on Riemann’s covariant theory of multi-dimension- 
al manifolds. Here it shall now be shown that an even stricter logical con- 
struction of the theory can be achieved by introducing an additional hypothe- 
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sis, indeed a bold one, on the structure of inatter.. . In what follows we assume 


that the condition 7,=0 is in fact generally satisfied... If we now stipulate 
the field equations of gravitation to read 
Hage as ie ’ 


we have thus obtained generally covariant field equations.” 
At last, in “The Field Equations of Gravitation” (Prussian Academy of Sciences, 
Berlin, Proceedings, Dec. 12th, 1915) Einstein reaches the final goal of his 
efforts for setting up the general field equations: 
“In two recently published communications I have shown how to obtain field 
equations of gravitation which ineet the postulate of general relativity, i.e. 
which, in their general form, are covariant with respect to arbitrary substitu- 
tions of the space-time variables. Here the course of development was as 
follows: initially I found equations which included Newton’s theory as an 
approximation and were covariant with respect to any substitutions with deter- 
minant 1. Then I found out that, if the scalar of the energy tensor of “matter” 
vanishes, there are generally covariant equations corresponding to those men- 
tioned above. The co-ordinate system then had to be specialized by the simple 


rule that ¥—g must become 1, which leads to an eminent simplification of the 
equations of the theory. Here, however, as mentioned above, it was necessary 
to introduce the hypothesis that the scalar of the energy tensor of matter van- 
ishes. But lately I discovered that it is possible to do without that hypothesis 
on the energy tensor of matter if the energy tensor of matter is inserted in the 
field equations in a way slightly different from that stated in my previous 
communications ... We set 


hy = % [Pin tn? (1) 


where g?°T,,=T°,=T ... This at last completes the general theory of relativ- 
ity as a logical system. The postulate of relativity in its most general form,. 
which makes the space-time co-ordinates physically insignificant parameters, 
leads, of compelling necessity, to a well-determined theory of gravitation, 
which explains the advance of perihelion of the Mercury.” 

It is not difficult to see that (1) is identical with (30.1.2/3) for A=0. In “‘Geo- 

metry and Experience” (Prussian Academy of Sciences, Berlin, Proceedings, 

Feb. 3rd, 1921), Einstein comments on the role of mathematics as follows. 
“Above all other sciences, mathematics is held in high estimation for one 
reason: its theorems are absolutely certain and indisputable, whereas those of 
all other sciences are, to some degree, disputed and always in danger of being 
invalidated by recently discovered facts ... But that high estimation of 
mathematics, on the other hand, rests on the fact that it is also mathematics 
that provides the exact natural sciences with a measure of certainty which 
they could not achieve without mathematics ... As far as the theorems of 
mathematics refer to reality they are not certain, and as far as they are cer- 
tain they do not refer to reality . .. The reason why I put a special significance 
on this conception of geometry described above is that without this conception 
it would have been impossible for me to establish the theory of relativity. For 
without it the following consideration would have been impossible: in a frame 
of reference which rotates with respect to an inertial frame, owing to Lorentz 
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contraction the laws of placement for rigid bodies do not correspond to the 
rules of Euclidean geometry; consequently, if non-inertial frames are allowed 
as equivalent frames Euclidean geometry must be abandoned.” 

The following quotations from Einstein have been taken from [13], and were 

extracted from Einstein’s personal letters. 

“The actual object of my research was always the simplification and unifica- 
tion of the physical theoretical system. I achieved this object satisfactorily for 
the macroscopic phenomena, but not for the phenomena of quanta and the 
atomistic structure. I hold that the modern quantum theory, too, in spite of 
considerable successes, is still far from a satisfactory solution of the latter set 
of problems.” (1932) 

“Coming from the sceptical empiricism of, say, Mach’s school, by the problem 
of gravitation I was made a believing rationalist, i.e, one who seeks the only 
reliable source of truth in mathematical simplicity.” (1938) 

“Belief in the conceivability of reality by something logically simple and co- 
herent ... It seems hard to get wise to the Lord’s game. But that he plays at 
dice or makes use of ‘telepathic’ means (as is suggested to him by the present 
quantum theory) I cannot believe for a minute. (1942) 

“The more one hunts after the quanta, the better they hide.” (1924) 

In a talk with E. Straus (Einstein’s assistant from 1944 to 1948), Einstein said: 
“What really intrigues me to know is whether God could have made the world 
differently; that means whether the requirement for logical simplicity leaves 
any freedom at all.” 

Finally we mention the following version of the last quotation: 

“T want to know how God created this world. I am not interested in this or 
that phenomenon, in the spectrum of this or that element. I want to know His 
thoughts, the rest are details”. 


Remark 1. Hilbert found the general field equations (1) (to which (30.1.2/3) corresponds 
with A=0) almost simultaneously with Einstein in November, 1915. The deduction of 
(30.1.2/3) from a variational principle, as was indicated in Subsec. 30.1.2., corresponds to 
Hilbert’s approach. Hilbert always pointed out that the priority is due to Einstein. and 
that his investigation founded on Einstein’s earlier work. Nevertheless it is strange that 
Hilbert’s important feat is always referred to only on passing (as was done also here indeed). 


Remark 2. “Jf Raphael, returned to life, were to paint a modern School of Athens of physi- 
cists, T think he would include Einstem with Galileo, Newton, and Maxwell pointing toward 
the heavens, while Faraday and Rutherford would point toward the earth” (KE. Segré, 
[57, p. 78]). 


6 


30.2. The Energy-Momentum Tensor 
30.2.1. Killing Vectors and Laws of Conservation 


The object of Sec. 30.2. is to investigate the tensor 7” and the coupling con- 
stand x from (30.1.2/3) more closely. Some preliminary considerations are useful 
for that purpose. From (30.1.2/3) and Theorem 29.5.7/2 it follows that 


C2 Sandia (1) 


are necessary conditions for the energy-momentum tensor. 


Oh 30.2. The Bnergy-Momentum Tensor Sd 
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Definition. The contravariant vector K' is called a Killing vector i ares Ce) 
holds for all r and s. 


Remark 1 As was stipulated previously, we speak of vectors rather than (more exactly) 
of vector fields. 


Theorem (Law of conservation). If the conditions (1) are satisfied and K is a Killing 
vector, then 


(V¥-gP"),= where PP T"'K,. 


Si ee oy oe : a 
Remark 2. P*., =0 can be verified immediately. Then the proposition follows from Theorem 
29.3.2(b) and a consideration using local co-ordinates. This differential law of conservation 
can easily be transformed into an integral one by utilizing the Gaussian theorem 9.3.1/2. 


Remark 3. Non-trivial Killing vectors thus lead to (differential and integral) laws of conser- 

vation. The existence of non-trivial Killing vectors, however, is always connected with 

symmetrics in the given metric space. An arbitrary Lorentz-metric space does not have 

any non-trivial Killing vectors. There are at most 10 linearly independent Killing vectors 

n(n-+t) 
2 


= 


{at most vectors in an n-dimensional metric space). The maximum number 10 is 


assumed if and only if the space has a constant curvature, i.e., if 


R 
Rovea = Ee (Jac9ba —JocYaa) - 


Reniark 4. In the Minkowskian space, the 10 linearly independent Killing vectors are given by 
Sa=Cgt fap’. €ab — — ba + 


where c, and é,, are constants. 


30.2.2. The Covariance Principle 


If at some point of a 4-dimensional Lorentz-metric space normal co-ordinates 
in the sense of Lemma 2 and Remark 4 in Subsec. 29.5.6. are chosen, then at 
that point covariant derivatives become partial ones, and in a neighbourhood of 
this point one has approximately the same conditions as in the Minkowskian space. 
Thus it suggests itself to regard normal coordinates as approximately represent - 
ing local inertial frames in the sense of the special theory of relativity; cf. Sub- 
sec. 25.1.1. This suggests the following approach to the derivation of physical 
laws within the general theory of relativity. 


Comma-semicolon rule (covariance principle). Find the special-relativistic formula- 
tion of those physical laws which have nothing to do with gravitation, and then sub- 
stitute covariant derivatives for partial ones (i.e., semicolons for commas). 


Remark 1. The Maxwell equations F”', =0 of (24.3.2/3) and F*)=0 of Remark 30.1.3/2 
represent one cxample. 


Remark 2. In the comma-semicolon rule, the same difficulties occur as in the quantization 
rule of Subsce. 28.3.1.: the translation is done from a commutative range to a non-commu- 
tative one. In the case of the quantization rule, (commutative) real numbers are trans- 
formed into (non-commutative) operators, while in the case of the comma-semicolon 
rule (commutative) partial derivatives are transformed into (non-commutative) covariant 
ones. Thus in differential expressions of second and higher orders one has a great number 
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of variants. To choose the “correct” one is a matter of physical intuition: it is not possible 
indeed to develop a higher theory from a lower one in a purely deductive way. 


Remark 3. The above considerations and Remark 30.2.1 /4 show that the differential laws 
of conservation of the special relativity theory hold approximately in the general theory of 
relativity. The basis of this was the existence of Killing vectors and the requirement that 
7T*' ,=0 in the Minkowskian space. In the sense of the rule stated above, 7*t »=0 is trans- 
lated into 7*!,;=0. Thus a physical motive for (30.2.1/1) has been found: it is desired that 
the differential laws of conservation of the special theory of relativity may hold approxi- 
mately in the general theory of relativity. 


30.2.3. Energy-Momeutum Tensor for Ideal Liquids 


Naturally, the covariance principle will also be extended to the energy-mo- 
inentum tensor 7!, where x in (30.1.2/3) is a coupling constant to be determin- 
ed later. It appears at least plausible to treat world lines of particles (e.g., gal- 
axies) in a curved four-dimensional space-time in the sense of the covariance 
principle according to the classical laws of ideal liquids, and to establish the cor- 
responding energy tensor. Let V* be the contravariant representation of the 
tangent vector of the time-like world line of a flowing particle (Fig: 30.1), then 
we normalize V4 V,= —1. The energy tensor then reads 


Th =(utp)VEV + pg, (1) 


if 
VA Fig. 30.1 


where the energy density » and the pressure p are scalar functions (uv includes the 

mass density 9). As was already indicated, this all happens in a 4-dimensional 

Lorentz-metric space with g,; as a fundamental tensor. From (30.1.2/3) and 
T*'..=0 one obtains 


Res Rg" + Agt=xT" ; (2) 
MyV'+(u+p)V",=0, oie (3) 
(utp) V*aV' + (g¥ + EP) py=0. (4) 


This fundamental system underlies many local and cosmological considerations. 


30.2.4. Comparison with Newton’s Theory of Gravitation 


The tensor 7’ in (30.1.2/3) is established on the basis of classical energy-mo- 
mentum tensors according to the covariance principle. What remains is the 
determination of x. To this end, consider the Newtonian limiting case: for a 
given mass density 9 in Rs, T',,=c2, in the CGS-system, is the dominant compo- 
nent in the energy tensor (¢ is the velocity of light), where we suppose that there 
exist no electromagnetic fields etc. which make contributions to the energy ten- 


a 
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sor. Let the Lorentz metric be g,;= 4) +f, where Ny 18 the metrie in the Min- 
kowskian space (cf. (25.1.1/1)) and f,; is a small perturbation. If this set-up is 
introduced in (30.2.3/2) for .1=0, then an approximate calculation pives “7 
= —xoc?, where 1 is the (three-dimensional) Laplace operator. The Newtonian 
gravitational potential ® is known to be determined from 4® =4nGo, where G is 
the Newtonian constaut of gravitation. It turns out that one has to set @= 

2) 


c 
=-— 3 hi. Then a comparison of the two potential equations yields Kinstein’s 


constant 
SrG 
x=—-=2:-10° 8 g-!lem-!s2, 
ct 


just a tiny value! For details refer to any serious textbook of General Relativity, 
e.g. [61], pp. 89-91. 


30.3. Equations of Motion 


30.3.1. Test Particles and Electromagnetic Waves 


Newton's theory of gravitation or Maxwell’s field equations still give no infor- 
mation of the equations of motion of particles in the corresponding fields. These 
equations must be postulated separately. See for instance the considerations on 
inertial frames as described in (25.1.1/2). Originally it was held that Einstein’s 
field equations must (and can) also be complemented by independent equations 
of motion for test particles. Test particles are particles with a positive mass m 
which move under no forcesin a given 4-dimensional Lorentz-metric space without 
influencing the latter. The stipulation was as follows: a test particle moves along 
a time-like geodesic line. Later on it was realized that, to some degree of neces- 
sity, this requirement follows from Hinstein’s equations. The argumentation, 
being physically plausible, though somewhat macabre from the mathematical 
point of view, may go along the following lines. Let x*(s) be the path of the par- 

k 


d ; x 
ticle, then one sets ve for r*=x*(s) (point of the trajectory), and V*=0 
s 


otherwise. (This is a contravariant vector distribution, as will be dealt with in 
Subsec. 32.3.2.) Here s is the curve length in the sense of Subsec. 29.5.4., so that 
V'V,= —1is satisfied for 2* = x*(s). For p=0 as well as 1 =9 =m one then obtains 
the energy-momentum tensor (30.2.3/1) in the sense of the covariance principle. 
Then (30.2.3/4) gives 


da*\ da? ; 
a ea eee ! 
( ds IL ds ; 


According to Remark 29.4.5/1 these are (time-like) geodesic lines. 


Axiom. (a) Test particles move along a time-like geodesic line. 

(b) Electromagnetic waves propagate along rays. 

(ce) World lines which describe physical processes including a transport of tnfor- 
mation are time-like or null-like at every povnt. 
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Remark 1. Part (a) justifies the above considerations. Rays (=null geodesics) were dealt 


da” da? 


with in 29.5.3. and 29.5.4. Part (c) means that On a=? is valid at every point 


a* =a (z) of the curve. Of course this all happens in a 4-dimensional Lorentz- metric space. 


Remark 2. Part (b) must be interpreted in a large-minded way. Consider a flash of light at. 
the point P in the Minkowskian space; it will propagate along straight lies (=rays) on the 
associated light cone towards infinity. Here the propagation of light is described by the 
Maxwell equations, cf. Subsec. 25.3.2. These solutions show that the Huyghenian property 
is satisfied, cf. Subscc. 19.3.4. In curved spacc-times the Huyghenian property is generally 
lost (cf. Chap. 33, where we shall refer to such problems in detail). A flash of light generated 
at the point P of a curved space-time will generally be recorded at all world points in the 
interior and on the surface of the associated charactcristic conoid. Part (b) of the axiom can 
thus be understood to imply that a light ray propagates along a zigzag path whose sections 
are rays. The (mathematical) cause of this scattering effect lies in the curvature of space 
(Fig. 30.2.). 


Fig. 30.2 


Remark 3. Part (a) of the axiom can (almost) be derived from Einstein’s field equations 
(for, speaking more strictly, from 7’.,—0). The question arises whether this startling trick 
will also work for other equations which, at a first glance, would be regarded as independent. 
If 7’;=0 is applied to the energy tensor 7*! for the Maxwell field (30.1.3/3), then one 
obtains the Maxwell equations for the vacuum, Leet), as described in 30.1.3., as a generic 
result (i.e.,in the general case). For details (in particular with respect to the “generic” 
character) refer to [43], pp. 471-473. Consequently, the Maxwell equations for the vacuum 
can also (almost) be derived from Einstein’s equations. 


Resting is rusting. 


(proverb) 


30.3.2. Proper Time and Twin Paradox 
\ 


Axiom. The proper time of an observer ts equal to the curve length of his time-like 
world line. 


Remark 1. Thus what is considered is an observer (i.e., physical events) with the time-like 
world line z*(s), where s is the curve length as defined in Remark 29.5.4/4. Differences in s, 
that means time intervals, are uniquely determined by this method. Physically, one has the 
same situation as at the beginning of Subsec. 25.1.5.: an observer determines his proper 
tume by a fixed physical procedure (e.g., by an atomic clock which is at rest with respect to 
him, thus always being at the same location as the observer himself). This fixes the timing 
which governs all the inorganic, organic and biological events which happen at rest with 
respect to him (including the aging of the observer). 


Remark 2. The general theory of relativity defines: time = path, while a proverb claims that 
time = money. The conclusion is that path =money. This is paradoxical, for the bulk of the 
money is known to be earned by sitting. 
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Twin paradox. Now the twin paradox of Subsec. 25.2.1. finds a simple solution. 
Two world lines extending from P to Q along the paths 1 and 2, respectively 
(Fig. 30.3), generally have different curve lengths. Hence the corresponding 
proper-time differences are different, too. A pair of twins who are the same age at 
P may thus be of different ages when they mect again at Q. Let twin 1 be a test 
particle: it follows the free interplay of forces, and hence moves along a time-like 
geodesic line of length s, from P to Q. Twin 2 offers resistance, e.g., in the form 
of extended space trips; let its path from P to Q have the length sy. In view of 
Remark 29.5.4/4, s, is extremal. It is easily verified that s, is maximal. Hence 
So <8, (all the considerations are local). Consequently the dynamic twin 2 is 


younger than its inertial brother 1. The popular voice has known that all along: 
resting is rusting. 


Q 


p Fig. 30.3 


30.4. Sehwarzschild’s Solution 
30.4.1. The Birkhoff Theorem 


In A; with the Cartesian co-ordinates x!, x2, x3, and r= V(x!)2-+ (2)2-+ (x3)2, let 
w= {(x!, x2, x3)|r;<r<r,}. Here O<r;<r,=-. A Lorentz metric gei(x*) in the 
4-dimensional space-time wx (—co, ) is called rotation-symmetric if g;(a7) 
depends on r and x‘=¢ alone. 


Theorem (Birkhoff’s theorem). After choosing suitable co-ordinates in w X(—<, oo) 
any rotation-symmetric solution gj, of the Einstein equations in the vacu um, 


ee = (1) 


can be represented as 
2M\-! j z 
dst (dat)? (det (aeye+| (1%) — i| are—(1 =) df (2) 


where M 1s an arbitrary positive constant. 


Remark 1. We use the common notation ds?=g,,da*da!. Tt may be supposed that (1) is 
written in v!, 22, 23, and 2!=¢t. The manifold w X(— 2, ) then consists of one chart provided 
with suitable coordinates x, ..., 24. To make g,; a Lorentz metric, we require that 2M <r;< 
74 =~ (Schwarzschild’s exterior solution), ef. Fig. 30.4. Then 247 las the dimension of a 
length, and is called the Schwarzschild radius. What is remarkable is the fact that kl 
depends on r alone, but not on ¢. Thus one obtains (although this has not been required) a 
stationary solution of (1) (i.e., a solution independent of ¢). 


Remark 2. If the polar coordinates 7, 3, y of Subsec. 12.4.1. are used, then (2) reads as 
follows: 


¢ 
= 


OMNS1 M : 
ds2= (: =) dr? +7232 +72 sin? dp? — (: - =| dt? , (3) 
ep 
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(cf. (12.4.1/2)). This is the usual form. It is simpler, but pretends singularities as @>0 and 
&-—. Consequently a representation in polar co-ordinates requires several charts. Then (3) 
is a local representation for a suitable choice of the x-axis. 


Fig. 30.4 


Interpretation, As 7— 0, ds? in (2) (or (3)) tends towards the metric of the Min- 
kowskian space, to which we had attributed physical reality. Here x!,x2, x3 were 
space co-ordinates and x4 =¢ was the time co-ordinate. Therefore it suggests itself 
to keep this interpretation in the empty Einstein space, 


Neane a) | 2M <1r<co} X(—«, oo) 


with the Lorentz metric (2). Then a comparison with Newton’s mechanics as 
described in Subsec. 30.2.4. gives 


i 

M=m = in the CGS-system . (4) 
Here m is the mass of a particle at the origin, G is the Newtonian constant of 
gravitation, and c is the velocity of light. Hence the interpretation is as follows: 
a particle of mass m located at the origin of R; generates a space-time in {(x!, x2, 
x3) | 2M <r<o}X(—, ), the metric of the space-time being given by (2). 
(x!, x2, 23) are space co-ordinates and 24 is a time co-ordinate (which must not, 
however, be confused with the proper time of particles moving within this space- 
time). For the earth one obtains 2M =8.8 mim, and for the sun, 2M@=3 km. The 
Schwarzschild radius becomes interesting only for high-mass, but relatively 
sinall stars. 


30.4.2. Eddington’s Form of the Schwarzschild Solution 


Let Yt be a metric space in the sense of Def. 29.5.6/1, with the metric g,). If WM 
is an open connected proper subset of Wt, then according to 29.1.2. one may estab- 
lish the induced topology on Wt (Fig. 30.5). By a suitable restriction of the 
associated equivalence class of atlases in the sense of Def. 29.1.2 and by g,)(P) = 
= 9(P) for PEW, WM is made a metric space. The equivalence class of permissible 
atlases for Wt, however, is much larger than the set of those atlases which result 
from the restriction of permissible atlases on Yt to MW. In particular one may 
imagine permissible atlases which become singular as the border AM =IMAIM of 
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Fig. 30.5 


Wt is approached. Hence by suitable atlases (local charts) the origin of 3% from 
We can be completely obliterated. Now it is of interest to invert this questioning: 
given a metric space Yt with the metric g,). The question is whether, in the sense 
stated above, I can be embedded in a larger metric space §)t. The considerations 
show that one must be very cautious in judging this problem. An unsuitably 


chosen atlas in Wt might pretend singularities which in reality do not exist. This 
is exemplified by the following investigations. 


Eddington’s metrie. We consider the Schwarzschild metric (30.4.1/2) in I= 

={(c', 2°, P) | 2M =r<=} (=e, @), (We write Wt instead of M in order to 

avoid confusion with the Schwarzschild radius.) From now on we shall always 

use the form (30.4.1/3), where it is clear how the apparent singularities occurring 

as §--Q and §>7~ must be eliminated. Introducing new local coordinates in M by 
e 


lore Io 3) and et = x4 OAT In ( - i), 


) M 


“ 


one obtains ds?=g',dx*dx’', where 


2 2 
Lop) 0 aE 
r r 
pe 0 Pr 0 0 
ei = 0) 0 r2sin2 & 0 : ») 
2 2M 
Pe 0 <0 =| 
7 r 
2M 2M 
WM t=7 4 .., then g = +——, and if =r _..,., then g'4= = ihe 
i 


apparent singularities which occur as # +0 and )— 7x can again be eliminated by 
changing to the Cartesian co-ordinates x*. It is now easily seen that the Edding- 
ton metric can be extended from St to 


Vex {(x1, x, x3) | Q<r<0}X(—~, oo) 
2 is singularit d that 9, =g', is a solution in I of Einstein’s 
(r=24f is not a singularity), and that g,=g 4 is as 5 
Te 
: Al 
equation for the vacuum iy a je: 


a . a pre. ql . . . . ye B . P 
Interpretation. In the transformation x = x'*(2'), it is only the time co-ordinate 
t which is transformed into ¢’. The space co-ordinates z!, x, x remain wuchanged. 


Hence we can take over the interpretation from Subsec. 30.4.1. and extend it 
from $e to We. 
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30.5. The Classical Effects of the General Theory of Relativity!) 
30.5.1. Planetary Motion 


In Sec. 12.4. we had dealt with the planetary motion as described by Newton’s 
theory of gravitation. We shall now investigate the same problem using Kin- 
stein’s theory. The aspects we are interested in are the differences from Newton’s 
theory which occur in this case. 


The model. Let the sun be fixed at the origin of the R,. According to the inter- 
pretation given in Subsec. 30.4.1. it generates (in the otherwise empty space) a 
space-time metric in W={(z!, x2, 23) |2M<r<eo}, with the Schwarzschild 
metric (2) or (3) stated in Subsec. 30.4.1. Here 2M =3 km. Planets are treated as 
test particles which do not influence this geometry. According to Axiom 30.3.1(a), 
planets thus move along time-like geodesic lines in Yt (with the Schwarzschild 
metric). Here x!, x2, x3 are interpreted as space co-ordinates of the real 3-dimen- 
sional space. Using the curve length s (proper time), one has the following set of 
differential equations to be solved, 


d2r* +{ k } dx! dx” dx* dz! 
a 5 SS i= SoS ’ 
ds? ds ds oi ds ds 

Tf again polar coordinates 7, %, m, and ¢ are introduced as functions of s, then 


the followimg set of equations follows from (1) (without the last equation) and 
(30.4.1/3). 


(1) 


lm 


d dt i 2M 

eee ay oC) aes 2 

i, (” 4 Ovi ial ae (2) 

d ary di ji ide far, d#\2 dy\? 

Cail | 2 ai rei te RE pis) ee oh (a ee ee (ima oe 

il =| eS dr (=) "2 dr (=) (=) ay »(52) o 

d / ,ddé eee de\? 

(ap) -7 0 # 008 0 (54) — 0 (4) 

Oy oe 5 

a ( sin? 3 +] — (i (5) 
If the initial values (12.4.1/6) as well as ¢(0) and ¢(0) are given, Eqs. (2)—(5) 
have a unique solution (at least locally). Here i etc. This is the analogue 

ds 


to Eqs. (3)—(6) of Subsec. 12.4.1. 


Plane orbits and Jaw of areas. From (4), the initial conditions #(0) =< and 

') We quote few sentences from [53, p. 9]: “Special relativity and spccial relativistic 
quantum theory have been checked literally billions of times. But for many years only small 
and poorly measured effects within the solar system indicated that general relativity gave 
better answers than combining its special relativistic and Newtonian limits ad hoc. Today, 
more accurate measurements within the solar system, the tentative success of general 
relativistic models for white dwarf stars and pulsars, the possible discovery of black holes 
and of the gravitational radiation predicted by general relativity, and the tentative success 
of general relativistic cosmology have given general relativity a somewhat firmer empirical 
foundation.” 
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a=” as well as from the uniqueness of the solution, one finds that 
d 
=>5.- Then from (5) it follows that r2 weeh. In other words, Theorem 12.4.2/2 
is also true within Kinstein’s theory. 


Advance of perihelion. As in Subsec. 12.4.1., s=0 shall be chosen to define that 
point of the planetary orbit which is next to the sun (perihelion). As in Subsec. 


12.4.3. we use ¢ instead of s as a parameter, setting r(s) = : —— Lhen (using vine 


last equation in (1)) one obtains o(9) 
d2o M 
Ape oe RB +3Mo?. (6) 


Apart from the term 3402, this is just the differential equation for o(¢) as given 
in Subsec. 12.4.3., which had the solution (12.4.3/1), 
M 
moe ye Oey), O<e=<l1. (7) 


This was an ellipse with the perihelion given by s=0 and ¢(0)=0. For the Mer- 
cury, the planet next to the sun, one has 


M 
Ce we ty i © IOS . 


For the other planets this dimensionless number is even smaller. Thus the rela- 
tivistic correction 3Mo? in (6) turns out to be downright tiny. It suggests itself to 
look for an approximate solution of (6) which has the structure of (7). To this 
end, cos ¢ in (7) is replaced by cos egy, where eshall be a number close to 1. The 
best value of e yields the approximate solution 


M M2 
1+ecos {1-3 Y (8) 
he h? 


of (7). As stipulated above, p=0 is a perihelion. The next perihelion is passed 
3M? 2 
when (1 — 


h? 


o(~)~ 


3 ie: 
=a 7, 1.€., Or gies or (1+ I. Consequently a perihelion 


h? 
o 


oi results. Hence the Kepler-Newton ellipse becomes a 


motion of 1=6x 


(slowly rotating) rosette (Fig. 30.6). The corresponding time-like geodesic line in 
Wis a spiral in the ¢-direction, whose projection yields the rosette. The table be- 
low compares these theoretical data with experimental ones. The figures indicate 
the advance of perihelion in angular second per century. (Mutual perturbing 
effects of the planets have already been eliminated here.) 


Fig. 30.6 
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| Theory | Observation 
ecu 43.03 | 43.1140.45 
Venus 8.6 8.4 +4.8 
Earth 2.8 D.O sel. 
Mars 1.35 


30.5.2. Deflection of Light 


The problem to be discussed now is the behaviour of electromagnetic waves in 
the vicinity of heavy masses. 


Model. Again we choose the same space-time Jé as in Subsec. 30.5.1. with the 
Schwarzschild metric (2) or (3) given in Subsec. 30.4.1. Again let 2M@=3 km 
(that means the heavy mass is the sun). According to Axiom 30.1.3(b) elec- 
tromagnetic waves (with possible scattering effects being neglected) propagate 
along rays (null-geodesics). x!, x2, x? are again interpreted as space co-ordinates 
of a real 3-dimensional space. Then one has to consider the set of differential 
equations (30.5.1/1), where in the last equation 0 must be substituted for —1. A 
transformation to polar co-ordinates leaves the equations (2)—(5) of Subsec. 
30.5.1. unchanged. 


: di 
Deflection of light. With the initial conditions 9(0) =" and = (0) UF one 
: 8 


a 


again obtains #(s)=>. This means that rays of light likewise propagate within 


a fixed plane, which is the x',x2-plane in the present case. With the use of 


dx! dx* 


2) nS 0, the analogue to (30.5.1/6) now reads as follows: 
d2 
a te-3Me?, (1) 
dg? 


1 ; : ; 
where g=— and » have the same meaning as in 30.5.1. The approximate equa- 
3 Men Be 


COS ‘ 
a and hence x!=7 cos p= FR. In- 


tion 


ceeebennr 
~+6=0 has the solution —=6= 
dg? y 
sertion of this zero-order approximation in the right-hand side of (1) gives the 
approximate solution 


Fig. 30.7 
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») 


R 
augle between the two asymptotes is O = 1.75” (angular seconds), ef. Fig. 30.7. 
Consequently, light is deflected by heavy masscs. This effect can be verified 
experimentally by comparing night photograplis df a celestial zone with corre- 
sponding photographs taken duriug a total solar cclipse (Fig. 30.8). The measured 
values lie between 1,28” and 2.71”. The covering of strong radio sources in the 
universe by the sun yields a quantitatively better confirmation of this effect 
(with a deviation of less that 10 °,). 


with the asymptotes r~ R— 


ly|. Let R be the radius of the sun, then the 


reference 
ray 


photoplote Fig. 30.8 


30.5.3. Red Shiit in the Gravitational Field 


Frequency shifts. Consider a 4-dimensional Lorentz-metric space with the 
stationary metric 


€ 


ds?=g,adr*dx'+g,,(dé)2, a=1,2,3 and £=1,2,3, 


where the sum has to be taken over doubly occurring indices (from 1 to 3). 
“Stationary” means that g,,; depends on r=(2!, x?, x3), but not on ¢=.x', Then 
from Subsec. 29.5.3. one may conclude that, if w(x, 4)=0 is a characteristic 
surface, this is also true for w(x, ¢+c)=0 for any real number c: invariance of 
translation in the ¢-direction. But then characteristic conoids, aud hence also 
rays (cf. Subsec. 29.5.3.) are likewise translation-invariant in the ¢-direction. 
According to Axiom 30.3.1(b), rays are world lines of electromagnetic waves. 
x=(r!, x2, x3) is interpreted as a position in a real 3-dimensional space, and ¢ 
as a co-ordinate time. Hence the world line of a non-moving observer xp is a 
straight line passing through x, in the direction parallel to the ¢-axis (Fig. 30.9). 
Then for the proper time 7'y of xy one has d7= —gy,(xo)dé2. Consequently, 
AT), =V¥ —gag(xp) At. An analogous result is obtained for x;. Let the observer xp 
trip an emission of light with N cycles in the co-ordinate time interval Ag. 
The observer x; also records this light as N cycles in At. Then the frequency 
yy at ao, which must of course be referred to the proper time, is given by 


N Ly 
Vn, awe 
0 V~gra(%y) At 
aud an analogous relation holds for », at x;. This gives 


m= M% yee. (1) 


Gas41) 
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In view of |g4,(x1)l>|ga,(xo)|, it follows that a red shift of the spectrum, i.e., a 
frequency reduction, occurs. 


Schwarzschild metrie. The above conditions are satisfied for the Schwarzschild 
9 


2M Me uM. 
metric (30.4.1/3), with g,,(7) = — (1 2) . Considering that in real cases |; 3 


a very small value, one obtains (approximately) 


Ay 1 it d 
sam (7-2) with 2M<ryg<ry<o and Ayv=—», 
Wy a 
as a red shift in the Schwarzschild metric. 


light rays 


Fig. 30.9 Fig. 30.10 


Test (Fig. 30.10). Choosing the radius of the earth to be 7p, and setting r,= 


At 


A : : 
=7)+22.5 m, one obtains —=2.5- 10-15, Ib is past belief, but yet this effect 
v 


0 
ean be verified by experiment. The result, 


Ay 
—=2.5- 10-15 (1.05+0.10) 
a) 


at the saine time yields one of the best quantitative confirmations of the general 
theory of relativity. 


Now, my suspicion is that the universe 
is not only queerer than we suppose, 
but queerer than we can suppose. 

(J. B.S. Haldane, “Possible Worlds’’) 


31. General Theory of Relativity I (Singularities, Black Holes, 
Cosmology) 


31.1. Singular Manifolds 
31.1.1. Criteria 
The modern theory of relativity is characterized by deep-going studies on 


singularities in curved space-times. In Secs. 31.1. and 31.2. we shall try to 
give an impression of these studies. Here we restrict ourselves to a few (as we 
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hope) typical concepts. For details, necessary more precise definitions and a great 
number of other fundamental concepts we refer to [24]. This book has also 
been taken as a basis for the following presentations. Further we refer to 
Chap. 33, which can be considered as a continuation of the present considerations. 
A description of the present state of the theory of relativity has been given in 
[81]. As in Chap. 30, M is always assumed to be a 4-dimensional Lorentz-metric 
space: a (curved) space-time. We ask for mathematical and physical criteria 
according to which such a space-time can be evaluated. 


Definition 1. A space-time M 7s called time-like complete if every time-like geodesic 
and every null geodesic is part of a corresponding geodesic whose geodesic parameter 
runs through all real numbers. M is called singular if M is not time-like complete. 


Remark 1. In view of Theorem 29.4.5, the definition is meaningful. Consequently, the 
question is whether every time-like geodesic and every null geodesic can be so extended that 
its geodesic parameter s runs from —~ to +. Such a question appears very meaningful, 
because s (after being normalized) is the proper time of test particles. If M is complete, then 
a test particle can thus travel over an infinitely long time without anything happening. 


Remark 2. There are several other (in part more restrictive) concepts of completeness, 
which have been developed by Penrose, Hawking and Ellis. We refer to [24]. 


Remark 3. The Minkowskian space is time-like complete. Here the geodesics are identical 
with the straight lines. However, if one removes a point or a closed setw from the Minkow- 
skian space, then the remainder is a singular space-time (Fig. 31.1). 


Minkowskian Space Fig. 31.1 


Problem. The last example illustrates the problem encountered: can a given 
singular space-time M be diffeomorphically embedded into a larger, time-like 
complete space-time M? Speaking more precisely, what is desired is a time-like 
complete space-time M and an open set 2 in M with the following property: 
if, as described at the beginning of Subsec. 30.4.2., @ is transformed into a 
space-time, then there is a diffeomorphism of M onto 2 in the sense of Subsec. 
29.1.2. Generally this is not possible. This compels introducing new concepts. 


Definition 2. A space-time M is called locally inextendible cf there vs no open set w 

of M with the following properties: 

1. @ (the closure of w in M) 1s not compact, ae - 

2. w can be diffeomorphically embedded into a space-time M in such « way that o 
(the closure of the image @ of w in M) is compact. 

M is called locally extendible otherwise. 
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Remark 4. Compact sets have been deseribed in Subsee. 29.1.1. Roughly speaking, the 
definition is as follows: sinee @ is not compact in MZ, w borders on the boundary of M along 
the bold solid line, ef. Fig. 31.2 (this is not a precise formulation). If w can be extended 
beyond this boundary into 47, then @ is eompaet in AM. Then all the boundaries of @ lie in 
the interior of Jf. The definition is in effeet the precise formulation of this illustrative 
deseription. 

Remark 5. If, for example, g(x) =det g,;(%) +0 as ~€@ tends towards the bold line segment 
in Fig. 31.2, then this shows that an extension beyond this part of the boundary is not 
possible. 


ty ” 
+ a 
eee ee ) 


Fig. 31.2 = Fig. 31.3 


Tinie-orientable space-times. As the case may be, we shall place additional, 
physically reasonable conditions on_a space-time as well as on the energy- 
momentum tensors 7*’, Thus the condition that there shall be no time-like 
closed lines is physically reasonable (chronology condition). This simply means 
that a journey into one’s own past shall be impossible. An essential concept 
is that of time orientability: a space-time M is called time-orientable if the 
local light cones (local characteristic conoids) can be subdivided in M in a con- 
sistent and continuous manner into a “future cone’+ and a “past cone” — 
(Fig. 31.3). Locally (within one chart) this is always practicable (see also Subsec. 
33.2.2.), whereas globally, i.e., on the whole space-time M, it is an essential 
additional requirement, which is physically highly reasonable. 


Requirements imposed on T*, The energy-momentum tensor 7 and the 
metric gj; of the space-time M satisfy Einstein’s equations (30.1.2/3). The follow- 
ing (mutually independent) requirements on 7 are physically reasonable. 

(a) (Weak energy condition): For all time-like vectors V,, T”V,V,=0. 

(b) (Time-like convergence condition): For all time-like vectors Vz, R!V;,V1=0. 
(c) (Generic condition): For every time-like or null geodesic, the tensor 
Vir Rupa Vs) V? V4 is different from zero at one point at least of this geodesic. 
V;, is the tangent vector to the geodesic. 


Remark 6. In eaeh partieular ease we shall exaetly indieate what kind of additional eondi- 
tions are imposed on a space-time. 


31.1.2. The Sehwarzschild-Eddington-Kruskal Metrie 


Let us apply the considerations of Subsec. 31.1.1. to the Schwarzschild metric 
(30.4.1/3) in Yt and to the Schwarzschild-Eddington metric (30.4.2/1) in M. 
Mt and Wt are the space-times of Subsec. 30.4.2. If the domain w has the shape 
indicated in Fig. 31.4, then @ is not compact in I, but it is compact in YN. Hence 
Weis locally extendible, and henceit is not complete. Nis the extension of It. The 
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2 9 


- which becomes singular as r-0. On the 


curvature scalar in tis R= 


other hand, the proper time (measured in the CGS-system) of a time-like line 
that starts from a point 0<r<2WM (the counting of the proper time is also started 


at this point) and moves in the positive ¢-direction is at most 1.54 - 1075 i 


; © 
sec. (Mo=1.5 km isthe Schwarzschild radius of the sun). From these two facts 


it follows that It is not time-like complete bnt cannot be extended beyond 
r=0. Nevertheless it turns out that Mt is locally extendible. 


Fig. 31.4 


Kruskal metric. The Kruskal metric has the form 


ds? =e 23f (dz2—dw2) + r2(d2 + sin29d¢?). (1) 


Se = 2 
? ee 


It is obtained from the Schwarzschild metric (30.4.1/3) by substituting z= 
=2(r, t), w=w(r, t), while g and # are left unchanged, where 


t t 
1 ele 
Yo yD =e 2M 3 Z 
Ore a ee =: (2) 


efll + e 1m 


This also explains the terms with r=r(z, w) as well as with p and # in (1). 
r=const are hyperbolas in a z,w-plane (Fig. 31.5), where the uninteresting 
§-g-dependence is omitted. Kruskal’s space-time t* then lies between the 
two branches of the hyperbola w?=2?+ 1. Here the Schwarzschild space-time We 
becomes the region J, i.e., {(w, z) | z>0, lwl<z}, while Eddington’s space-time 
Mt becomes the region I+II, i-e., {(u, 2) | (w, 2)EM*, wt+2z>0}. (Here it is always 
necessary to add # and g, so that 4-dimensional manifolds are obtained.) 
Consequently Jt* is an extension of M. It is not complete (for the above consid- 
erations for r~>0 concerning the Eddington metric are still applicable), but 
it is locally inextendible. 


Light cone. The straight lines z= tw+const are null geodesics in t*. This 
follows from (1) with #=const and p=const. Wt is time-orientable. Let the 
direction of increasing w-values be the futnre direction. To this there corre- 
sponds the future cone drawn in J. The future direction is continued continuously 
to M*. This leads to the following interesting situation (Fig. 31.6): information 
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can be transmitted from I (that is Nt) to II (that is MEM) on time-like or null 
world lines. On the other hand, no information will pass from II to I. In our 
terminology to be introduced later on, II is a black hole. 


Fig. 31.6 


31.1.3. Closed Trapped Surfaces 


Closed trapped surfaces are of fundamental importance in the singularity 
theory of space-times developed by Penrose, Hawking, Ellis and other workers. 


Definition. Let M be a time-orientable space-time. Let F be a compact space-like 
two-dimensional surface without boundary in M. Then F is called a future-directed 
closed trapped surface if each of the future-directed families of rays orthogonal to F 
converges (past-directed closed trapped surfaces are defined analogously). 


Remark 1. Time-orientable space-times have been defined in Subsec. 31.1.1. A surface F 
(which is naturally regarded as a C~-surface) is called space-like if the tangent vectors 
k 
We == of every C~-curve #*(t) that extends within F are space-like, i.e., if gn V* Vi =0. 
Jf the local future light cones are erected at every point of , then the enveloping surfaces 
obtained are two characteristic surfaces (Fig. 31.7). For details cf. Subsecs, 33.1.2. and 
33.1.3. There it is also set out that these two characteristic surfaces are spanned by dis- 
tinguished rays that are orthogonal to F (cf. (1) and (2) in Subsec. 33.1.3.). These are 
the two orthogonal future-directed families of rays mentioned in the definition. What is 
meant by the convergence of such a system of rays is intuitively clear (ef. the concept of 
caustic in Subsec. 33.1.3.). A precise formulation is, however, rather complicated; cf. [24], 
p. 262 and p. 101. A first version might read as follows (Fig. 31.8). Consider an (infinites- 
imal) two-dimensional surface element AF in F. The rays passing through AF (where we 
fix one of the two systems) are described by (33.1.3/2). Tf s>0 is fixed (where s=O corre- 
sponds to the surface /’), then one obtains a surface element (AF)(s). The 4-dimensional 
Lorentz metric g,; induces a (locally Euclidean) 2-dimensional metric on AF and on (AF)(s) 
By means of this metric it is possible to form the surface measure |AF(s)|. Convergence of 


d 
rays now means that re |AF(s)| <0. (Although it is not immediately clear whether such a 


definition of convergence is in agreement with that given in [24], pp. 262/267, it gives an 
at least qualitatively correct picture.) 


(AF I{s) 


AF(O) 
Fig. 31.7 F Fig. 31.8 
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Remark 2. If F is the surface of a 3-dimensional ball in the planc a*=0 in the Minkowskian 
space, then the two characteristic surfaces mentioned above are concs (Fig. 31.9). The 
associated rays (straight lines in this case) converge for the “interior” cone and diverge for 
the “exterior” one. In the Minkowskian space there are no closed trapped surfaces. The 
corresponding picture in the Eddington metric (30.4.2/1) with “‘+” looks different. If one 
chooses for F the surface of a 3-dimensional ball with centre 0 and radius R <2 (Fig.31.10), 
then the discussion in Subsec. 31.2.1. shows that now both of the characteristic surfaces 


mentioned above extend “inwardly”. Both families of rays converge. F is a closed trapped 
surface. 


ir 
Fig. 31.9 


Remark 3. The last example demonstrates the physical meaning of the concept introduced 
above. If light is emitted on /F, this light will propagate into the interior of the ball. The 
light is trapped (trapped surface). 


Fig. 31.10 


31.1.4. Singularities 


Until as late as the sixties, the following opinion was widespread among work- 
ers on relativity theory: the singularities of the known explicit solutions of 
Einstein’s equations are due to the high syinmetries of the corresponding 
metrics (the Schwarzschild-Eddington metric turning King’s cvidence). It was 
believed (and hoped) that general (non-symmetric, realistic) solutions of Ein- 
stein’s equations would avoid such singularities. The clarification of this 
question grew more and more urgent. On the one hand the cosmological conse- 
quences of such singularities were known fora long time: Friedman’s models, big 
bang (cf. Sec. 31.3.). On the other hand the results of modern astrophysics suggest - 
ed the existence of stars with an incredibly high density of mass: white-dwarf 
stars, pulsars, neutron stars (cf. Subsec. 31.2.2.). Spherical stars whose mass 
density is so high that their radius is smaller than the Schwarzschild radius 
appeared, and still appear, quite real from the physical point of view (black 
holes). For this the Eddington-Schwarzschild metric provides an impressive 
theoretical description (cf. Sec. 31.2.). Thus the question whether singularities 
are locally and globally unavoidable for real space-times is not only of mathe- 
matical but also of physical interest. Since 1965 (Penrose) these problems have 
been systematically studied within a qualitative theory. Roughly speaking, 
the result (which is positive or negative depending on the respective point of 
view) reads as follows. Singularities in rcal space-times are not the exception 
(due to symmetries) but the rule. Reference [24] (especially Chapter 8) presents 
a systematic description of these results, which are due to Penrose, Geroch, 
Hawking, and Ellis. Here we quote a typical theorem that shall give an im- 
pression of this theory. 
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Theorem. (Hawking, Penrose, 1970). Let M be a time-orientable space-time 
without time-like closed lines. If T*! is the energy-momentum tensor in the sense 
of Einstein's equations (30.1.2/3), then let the demands (b) (time-like convergence 
condition) and (c) (generic condition) made upon T* at the end of Subsec. 31.1.1. be 
satisfied. If M has a future-directed closed trapped surface, then M is singular 
(in the sense of Def. 31.1.1/1). 


Remark. It is no longer the question of explicit solutions of (30.1.2/3) that is considered here. 
The propositions are purely qualitative. The fact that the existence of closed trapped sur- 
faces leads to singularities appears physically plausible. This can be realized from the 
example of the Schwarzschild-Eddington metric. Since the kind of time orientation is 
irrelevant, the closed trapped surface referred to in the theorem may also be past-directed. 


31.1.5. Black Holes 


A precise definition of a black hole within Penrose’s qualitative theory is 
rather complicated (cf. [24]). Here we content ourselves with a somewhat 
poorly defined presentation, which is however hoped to correctly describe the 
essence of this fundamental concept. Consider a time-orientable manifold M. 
Without loss of generality we assume that M can be represented by a single 
chart: let M be an open subset of R,= Rs x (—~, «). (Otherwise the following 
investigations have to be interpreted locally.) The future direction shall be that 
of increasing ¢-values (¢=.r’). In A,, let 8 be an open set consisting of a finite 
number of tubular domains as indicated in Fig. 31.11. In particular, let, for 


every (€. hy, SM {t=const} be a 3-dimensional compact set consisting of finitely 
many connected components. Let A(t) be such a connected component, and 


let 0B(t) be its 2-dimensional boundary, i.e., @B(t)= B(t)\B() (in Rs, with ¢ 
being fixed). The volume is denoted by |@B(t)|. Now let M be the union of 
FR \S and an open set Sp with Sp cS, that is, the exterior of the tubular domains 
and parts of the tubular domains. The sets 4 or 5, but also 1-3 or 2—3 (Fig. 31.11) 
are called branches of S. 


t 
agit) Fig. 31.11 Btt) Fig. 31.12 


Black hole: A branch A of S is called black hole 7}, for every point PE AN M, 
the future-directed light cone (characteristic conoid) with origin P is also included 
in AM (Fig. 31.12). 


White hole: A branch A of S is called white hole if, for every point PE AN M, the 
past-directed light cone with origin P is also included in AQ M. 


Remark J. The precise definition looks more complicated. It consists of a constructive 
description of A(t). The question for the final stage of a black hole is of special interest. 
What will happen to B(t) ast > (stationary black hole)? 
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Conjecture. The final stage of a black hole 7s described by a Kerr-Newman metric. 


Remark 2. The Kerr-Newman metric contains 3 parameters, which correspond to the mass. 
the angular momentum, and the charge of the black hole. Special cases are the Eddington 
metric (non-moving neutral black hole; angular momentum and charge are zero) and the 
Kerr metric (rotating neutral black hole; charge is zero). These cases will be discussed in 
detail in Sec. 31.2. It appears that the above conjecture is not yet finally proved. But there 
are numerous theorems that point to this direction (naturally on the basis of a precise 
definition of a black hole); cf. [24], p. 331, and [43], p. 876. The surprising fact is that a 
stationary black hole does away with all physical particularities and reduces to three 
numerical measures (mass, angular momentum, charge). The pleasurable fact is that the 
associated space-time is explicitly known (since 1965). 


Remark 3. The physical meaning of the definition of a black hole is clear. Information 
can enter a black hole A M from outside, that is from &,\S, along time-like or null world 
lines, but no information can pass from 4AM to R,\S. We shall study this effect in detail 
later on, see also the considerations on the Kruskal metric set out in Subsec. 31.1.2. 


Remark +. The above conjecture suggests that black holes cannot disappear, whereas they 
can arise in the course of time, or also collide with each other (5 or 1-3 with 2-3 in Fig. 
31.11). The following assertion ean be proved on the basis of Penrose’s theory. 


Theorem (Hawking, 1971). If two black holes B,(t’) and Ba(t’) collide, then a new 
black hole B3(t’’) emerges, and 


|aBa(t’’)|=—l8B,W’)| + leBe’) . (1) 


Remark 5, ¢’ is a time long before the collision (then B,(’) and B,(t’) are almost stationary), 
and ¢” is a time long after the collision (then B,(t”) is almost stationary). 


Remark 6. There arises the question whether black holes are physically real. Black holes 
are, by definition, invisible. But they influence stars and gas clouds in their close neigh- 
bourhood, causing X-ray radiation and the like. On this basis, astrophysics has spotted two 
candidates for black holes, of which Cygnus X1 in the Swan constellation is the better 
known one. The satellites “Uhuru” (that means “freedom” in Swahili), 1970, and “‘Coper- 
nicus”’, 1973, for the measurement of X-radiation, have supported the suspicion; cf. [20], 
p. 134. We also refer to [63] as well as to [58], p. 264, and especially to [45], pp. 85—87. It is 
rumoured that there are workers on relativity theory who assess the situation as follows: 

There are holes either in the heavens 

or in the General Theory of felativity. 


31.2. Theory of Black Holes, Evolution of Stars 
31.2.1. The Eddington Metric 


As was mentioned in Remark 31.1.5/2, a stationary (that means, invariable 
in time) non-moving neutral black hole is described by the Eddington metric 
(30.4.2/1). The term “non-moving” here refers to the starry sky, which is 
regarded as an inertial frame. The space-time 3 of Subsec. 30.4.2. 1s time- 
orientable. We choose the future direction to be that of increasing ¢-values 
(t=2'). To obtain a black hole, we then have to choose the “+” sign in (30.4.2/1), 
men 


5) 2M 5 he ae x 
ds? = (1 =) dr2+4 = drdt — (1 — = dé2 + r2d024 r2sin2 ddg?. (1) 
r o 
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Using the notation of Subsec. 31.1.5., one has S={(x*)|0s7r<2M, t¢€ Ry} 
and Sy= {(r*)|O0<r<2M, t¢ Ry}. To show that Sp is a black hole, we restrict 
ourselves to r vs. ¢ diagrams (i.e. to lines with g=const and #=const). The 
rays ds?=0 are then described by the differential equations dr+dt=0 and 


) . 
dt=dr (2 - ' , which are shown in Figs. 31.13 and 31.14. Then the future- 


- 
directed light cones have the shape indcated in Fig. 31.15. It is seen that Sj 
is a black hole. By choosing the “‘—” sign in (30.4.2/1) we would have obtained a 
white hole. 


Fig. 31.13 Fig. 31.14 


Interpretation. As we stressed on several previous occasions we attribute physi- 
cal reality to the Eddington metric (=Schwarzschild-Eddington metric) as 
describing the gravitational field of a heavy mass (cf. the interpretations given 


Fig. 31.15 


Fig. 31.16 


in Subsecs. 30.4.1. and 30.4.2.). In particular, 7 is the distance of an observer 
from this heavy mass. For r>2M, the (time-like) world line of a non-moving 
observer A is a straight line parallel to the ¢-axis. For 0<r<2M such a world 
line is impossible, for r=2M it can only be realized by electromagnetic waves 
(stationary light). Let us now consider an observer B who approaches the 
black hole on the world line drawn in Fig. 31.16 (a space-crafttrip). As soon 
as he passes beyond P he is hopelessly lost. Within a short proper time he will 
crash against the singularity r=0 (cf. Subsec. 31.1.2.). A and B judge all what 
happens by referring it to their respective proper times (i.e., to the arc lengths 
s of their world lines). For A (with r=const, #=const, y=const) the proper 
time is proportional to ¢. The non-moving observer A, when looking at his 
travelling colleague B, finds that the latter slowly approaches the black hole 
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but (as referred to A’s proper time) will never arrive there. On the other hand, 
is: finds himself, regretfully, bound to disappear in the black hole very soon 
(within a finite period of B’s proper time). 


31.2.2. Stars 

It is believed that nowadays the general life history of a star is known, al- 
though a few details still have to be clarified. Extensive calculations have been 
carried out for that purpose. They have been based on thermodynamics, nuclear, 
and other processes. The models of stars obtained in this way are in good agree- 
ment with the star types observed. According to these studies, the life history 
of a star is as follows. 
Birth. Under the influence of gravitation, a cold gas cloud contracts. It is 
heated up. If the mass m of the gas cloud is less than 10-2 mo (where mo is the 
mass of the sun), then the cloud will not reach the temperature necessary to 
initiate the process H + He (conversion of hydrogen to helium accompanied by 
a liberation of energy). The gas cloud will then disintegrate. 
Lifetime. If m>10-2m,, then (after a transitional phase) a stable radiating 
star emerges. If the star has a very great mass, then there may be other nuclear 
processes occurring in parallel to or after H-- He (the star burns its ash). 


: Mo \? 
This stable phase willlast about 2 1010 (“2 °) years, and comes to an end when 
? 


the nuclear fuel is exhausted. The formula shows that stars of great mass will 
die early. 

Death. If, m<1.2 mo (Chandrasekhar limit), the star will end as a white dwarf 
(and later on presumably as a black one). Such stars have radii from 3x 103 


to 2x 104 km, a density of mass, pw, of about 106 =e, and consist of bare nuclei 
em 


and free electrons (electron gas). If m>1.2 mo, the star will throw off most of 
its mass into the universe, giving rise to the astronomical spectacle of a super- 
nova. What is left turns into a neutron star. Stars of this type have radii that range 


and consist 


between 6 and 100 km, a mass density uw of about 10!8 to 10! 


59 
Jv 


cm 
of neutrons (neutron gas). It is believed that pulsars are neutron stars. The best 
known example of a supernova and an associated pulsar is the Crab nebula 
(this supernova was observed and described by the Chinese in 1054). 
Remark 1. The above picture is believed to be qualitatively and (apart from common 
factors) quantitatively correct. It is supported by numerous theoretical considerations 
and experimental data (especially of radioastronomy). We refer to [43], §§ 24.2 and 24.3, 
[56], Chap. 1, and to [20, 58, 67]. For us those contributions are of interest which can be 
made by the theory of relativity to this question. 
Critical quantities. Let us again consider the Eddington metric. A contracting 
(spherical) star becomes a black hole if its radius r decreases below its Schwarz- 
schild radius r.=2M (Fig. 31.17). In the CGS-system, according to (30.4.1/4) 
one obtains 

G 4 


ewe 3 remy , 


= e 


25 Triebel, Math. Physics 
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where yw is the mass density. If in this formula r is replaced by the mass m= 


4. 
aa ru, then one obtains a critical mass m,. This gives 


1018 1039 
r=re~—— cm and m=m~4- —g, (ly 
(i Vu 
where w has to be inserted in 5. Some typical numerical values have been 
c 


entered in the diagrams of Figs. 31.18 and 31.19. Above the critical curves, the 
star is lost beyond its Schawrzschild radius, becoming a black hole. 


normal stars 


white dwarfs 
neutron stars 


Fig. 31.18 


705-107 10%-10% = (g/cm?) 


Fig. 31.19 
10+ pe (g/cm?) 


3 (density of atmosphere), then the critical mass is m,~105 mo. 


Remark 2. Tf « ~10~4 aa 
For such a low mass density the nuclear conversion H > He is not possible. So the exist- 
ence of huge cold masses forming a black hole is at least theoretically conceivable. 

Remark 8. Reference figures for the sun, ©, and the earth, E, are as follows: rg =7 X 10° km, 


8 . oO | = 8 
Ho=l4 eae Mo =2 X108 g, rg=6 x10% km, and Hp=5.3— 5° 


Oh, Be A Fine Girl, Kiss Me Right Now, Sweet! 
(Crib used to learn the spectral classes) 


31.2.3. The Hertzsprung-Russell Diagram and the Celestial Scale 


Books on modern astrophysics and radio astronomy are more thrilling to read 
than many a thriller [20, 28, 56, 58, 67], for example in comparison with [10]. 
In the preceding subsection we indicated some quantitative data of stars. Other 
information (especially about distances in the universe) is of interest for the 
later cosmological cosiderations. Therefore let us make some remarks on dis- 
stance measurement. 
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Ist step (of the celestial scale). If a star that is not so very far away is observed 
from different points of the earth’s orbit (Fig. 31.20), then its position exhibits 
slight variations relative to the other fixed stars (angle «). As the diameter 
of the earth’s orbit is known, the distance of the star from the earth can be 
calculated from these observations. For many stars this effect is too small to 
be measurable. 


Sf fixed stars 


‘SY star 
rece’ 
i % 


Z \ 
Fig. 31.20 


The Hertzsprung-Russell diagram (Fig. 31.21). On the other hand, however, 
the number of those stars whose distances can be determined by this geometric 
method is so large that it is possible to find a remarkable relationship between 
the spectral classes of the stars and their absolute brightness (multiple of the 
magnitude of the sun in the diagram). The spectral classes (set of all charac- 
teristic lines in the spectrum of light that are assigned to certain chemical 
elements) correspond to surface temperatures. The results is the Hertzsprung- 
Russell diagram, with a principle line and several secondary lines. 


spectral classes 


0. BA Fe 


6 


~ 


brightness 


Fig. 31.21 

1 extremely hot blue stars; 2sun-type stars 
(G stars); 3, 4 red giants; 5 variable stars; 
6 white dwarfs 


Ss 
+ 


an) 3 10°degree 


Variable stars. The geometric method yields, however, yet another remarkable 
result. There are stars whose stellar brightness shows periodic fluctuations (the 
period being a few days). For some of the variable stars (a sufficient number 
of them) the geometric method enables us to measure the distance, and hence 


ightness 
or 


absolute bri 
iS 


0 


eens, 31.22 
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period {in days] 
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to determine the absolute stellar brightness. It turns out that the period in- 
creases in the same measure as the brightness of the star (Fig. 31.22). 


2nd step. It is now supposed that the Hertzsprung-Russell diagram as well as 
the law of brightness periods of the variable stars are applicable to all stars 
(not only to those whose distances can be determined geometrically). But 
this makes it possible to determine the distances of all stars: by determining 
the spectral class one obtains the absolute brightness, and from a comparison 
with the observed brightness one calculates the distance. 


3rd step. In the late twenties, Hubble discovered variable stars in scveral 
galaxies. By means of the method of the second step he was able to determine 
the distances of these stars (and hence those of the galaxies). On the other hand 
the light of (most of) the galaxies exhibits a red shift of the spectral lines. If 
this red shift (decrease in frequency) is interpreted as a Doppler effect, one 
obtains an escape speed of these galaxies from our Milky Way which is pro- 
portional to their distances (cf. 31.3.1). Again it is supposed that this law is 
universally valid. This makes it possible to determine the distance of far galaxies 
from the Milky Way by measuring the red shift. 


Star evolution. The course of life of stars can be traced in the Hertzsprung- 
Russell diagram (Fig. 31.23; according to [86]). The first diagram shows the 
evolution of a star of the type of our sun: origin 1, principal line 2, red giant 3, 
ejection of matter 4, white dwarf 5. The second diagram characterizes the evo- 
lution of a massive star (m=>3mq): origin 1, principle line 2, red giant and 
supernova 3, neutron star 4. 


Fig. 31.23 


31.2.4. The Kerr Metric 


As mentioned in Remark 31.1.5/2, a stationary (i.c. invariable in time) rotat- 
ing neutral black hole is described by a Kerr metric. Here the term “rotating 
refers to the starry sky, which is considered to be an inertial frame. The angular 
momeutum, too, is measured by a non-moving observer far away from the 
black hole. The Kerr metric is a solution of Einstein’s equations in empty space, 


1 
os, legge =), having the form 


, for... wh i 
ds? =? es dd? } + (72 4+- a”) sin? Idgy? — de + —— (asin? bdy —dé)2, (1) 
; 2 
@=o(7, O)=72 a2 cos 7, ASA =F 2M as (2) 


Here M and a are constants, 0<r<o, 0<8<nx, and 0=y<2z. For a=0 one 


obtains the Schwarzschild metric (30.4.1/3). This suggests setting M =m s , as 
Gs 
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was done previously in (30.4.1/4), where m is the mass of the black hole. Further 
it turns out that Ma is proportional to the angular momentum, the body rotat- 
ing about the axis 9=0 or =n. From (2) it follows that (1) becomes singular 
for J(r)=0. This gives 

ry=M+VM2-a2 and r_=M-yM2_-22. (3) 
The ease M <a (Fig. 31.24). For AM <a there occur no singularities A(r)=0. By a 
transformation of r, O, ¢, t into x, y, z, Lit is possible to obtain the form 

ds?=dz2+dy?4+dz2—di2 


2Mrs— [r(vdx+ ydy)—a(ady—ydx) 2zdz _.\2 (4) 
ee, 9 > 5 + di 
pas Re eye r 
rity? 2 y 
Here x, y, z and r are related by 2p +—>=1. One thus obtains confocal 
a2+4r2 0 72 


ellipsoids in Rs, which contract to the circular disk x2+ y2=«2in the plane z=0 
as r-Q. (4) tends towards the Minkowski metric as r—<o. This suggests to 
interpret (x, y,z) as real space co-ordinates. (4) is the analogue to (30.4.1/2) 
and describes the gravitational field of a heavy mass rotating about the z-axis 
(this mass is not a black hole). 


singularity 
ot 
ee: 
ar ergosphere 
Fig. 31.24 Fig. 31.25 


The case M>a (Fig. 31.25). In the hope of being able to describe the gravita- 
tional field of a rotating black hole, one will first ask for a reformulation of (1) 
that yields the Eddington metric for a=0. The corresponding metric reads 


2Mr M 2Mr : 
ds2= — (1 — ad dt? ste didr + (1 +-—; Jave 
0 0? oe 
» 
— 2a sin? 8 (1 zat r) drdy —4 “ Mr sin2dqdt (5) 
0 0 


de es [(r2+@2)2—a?A sin? @]sin2ddp? + 07d? , 


ie 


b 


where 0 and 4 have the same meaning as in (2), and 0<r<«, 0<#<n, whereas 
g has a different meaning. For a=0 one obtains (31.2.1/1). Letting 7-- co one 
approximately obtains the Minkowski metric again (slightly perturbed by ay 
so that for large values of r this quantity can be regarded as the distance from 
the rotating body. As r—7r,, distortions similar to those in the case M <a are 
found to occur. It turns out that B(t)={r|r<r,} is a stationary black hole. 
r=r_ has no special meaning. In contrast to the Eddington solution (i.¢., 4 =O), 
for a0 one obtains a circular singularity (analogous to the case M <a) instead 
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of a point singularity. An event at the point C will be found to appear in the 
circle © close to C after a short period (Fig. 31.26): a stationary behaviour is 
not possible, one cannot escape from B(t). An observer A will indeed detect 
the rotation, but he can remain stationary. There is an interesting intermediate 
zone, the so-called ergosphere. It is bounded by r=r, on the inside and by 
r= M+YM2~—a? cos? } on the outside. An observer B cannot remain stationary, 
but he can escape from this dangerous zone to the outside: violent rotation with 
the possibility to escape. According to Subsec. 31.1.5., |@B(é)| is of interest. 
One obtains 


lOB()| =8xM(M +) M2—a?2). (6) 


Singularity 


& © 


os 
rotation Fig. 31.26 


Supply of energy. Reference [43], p. 906 and p. 908, tells the following story. 
An advanced civilization sends waste containers on a Penrose trajectory (see 
also Subsec. 31.2.5.) to a region near a rotating black hole (into the ergosphere). 
At a suitable point the waste is dumped into the black hole. The enormous 
rotation of the black hole causes the empty container to rush back to its start- 
ing point with a formidable speed. There its kinetic energy is utilized for, 
say, electric power generation. The final result is that not only the waste is 
converted into energy, but an additional amount of energy is tapped from the 
black hole. 


31.2.5. Energy Balance of Black Holes 


What happens when two black holes collide, or when a particle falls into a 
black hole? 


Collision. Suppose that the collision of two neutral black holes (represented by 
Kerr metrics with the parameters M,, a, and My, ay) again produces a black 
hole (Kerr metric with the parameters M3, a3). This corresponds with Conjec- 
ture 31.1.5 as well as with Theorem 31.1.5. Then it follows from (31.1.5/1) and 
(31.2.4/6) that 

M(M; + ¥M5—a3) > M y(M 1+ VM} —at) + Mo(My+V M3 —a}) . (1) 


Here it is quite possible that M,+M.—M,>0. According to Kinstein’s con- 
ceptions referred to in Subsec. 25.2.5., this mass defect is converted into enercy. 
An extreme case is that M,= M )=a,=a, and a,=0. Then it follows from (1) 
that M;>M,. Hence it is possible that M,+ M,—M,~ M,. 


Particles. According to Penrose, a particle that falls into a black hole B,(t) 
leaves another, new black hole B,(t’), where |@B,(¢t’)| =|éB,(é)|. We again suppose 
that the two black holes are described by Kerr metrics with the parameters M,, 
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ayand Mp), ay. Then it follows from (31.2.4/6) that 
M,(M,+VM3—«2)>M \(M,+VM?—a). (2) 


If the rotation is decelerated during this process, that is, if @y> ay, then by (2) 
it is possible that M,> M,. The mass defect M,— My is converted into energy. 
This result fornis the basis for the considerations on energy supply as described 
at the end of Subsec. 31.2.4. 


One thing I have learned in a long life: 
that all our science, measured against 
reality. 1s primitive and childlike — and 
yet it is the most precious thing we have. 
(A. Einstein) 


31.3. Cosmology 
31.3.1. Principles 


From the very beginning, the general theory of relativity has claimed to be 
able to give information of the universe as a whole. Detached about local 
particularities (such as stars, galaxies, black holes ete.), one looks for global 
solutions of Einstein’s equations, 


1 
ey hg ig! —.r" (1) 


making reasonable assumptions on the energy-momentum tensor 7. It is 
demanded that the cosmological principle is valid: the universe, when viewed 
from whichever of its points one may choose for observation, looks (globally) 
identical; there are no distinguished directions. Astronomical data furnish 
approximate support to this principle: galaxies, even clusters of galaxies, radio 
sources, and the so-called 3K-background radation exhibit a largely isotropic 
(non-directional) distribution. 


Data. There are only few data of interest for cosmological studies. Besides the 
3K-background radiation, which shall be dealt with later on, these data include 


the following items. 8 
1. The mean cosmic mass density o. Its estimated value is 1073! —-~=so0s 
em? 
19-29 8 


cm? 
2. The frequencies » of the spectral lines of the light of far galaxies (and of 
objects of radioastronomy) are shifted by A» to lower values (red shift). This 


A - 
phenomenon is described by Hubble’s law, = —Hr, where H is Hubble’s 


constant and r is the distance of the object from the earth (cf. Subsec. 31.2.3.). 
The present value is about H~!~ 102% cm ~ 10" light years (in [43], p. Tae. 
the value of H~!=18x 10" Ly. is indicated). 

3. For the age of the world, the following data are available. Investigations of 
terrestrial rocks on the basis of the half-times of radioactive elements yield 
a maximum age of 4.5109 years. Star models yield age data of at most 
3X 101° years. 
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Remark. Every model of the universe has to take account of these data (at least in terms 
of their order of magnitude). Conversely, when models of the universe are investigated 
whieh contain free parameters (and this will be the case), then these data can be used to 
determine the parameters in question. 


Set-up for T*!. We make the set-up (30.2.3/1) for 7, that is, T= (u+p)ViV'+ 

+pg", where, following the cosmological principle, we assume that p= x(t) 
and p = p(t) depend only on the time =~‘, but not on the position (a!, x2, 23). One 
then obtains the system (1)—(4) of Subsec. 30.2.3. For the streamlines V* of 
Subsec. 30.2.3. we introduce 3 parameters: dilatation V*.,, vorticity Viz.) and 


shear o,;= V (is) 3 Ve 


31.3.2. The Robertson-Walker Metric 


Theorem. A non-vortical (irrotational) and shear-free model of the universe can be 
represented in suitable local co-ordinates by a Robertson-Walker metric ds? = 
=S?(t)do?—dé2. Here do? is a positive definite three-dimensional line element 
whose associated curvature scalar ts constant.» 


Remark 1. What is desired is a solution of (31.3.1/1), where 7’*! satisfies the set-up of 
Subsee. 31.3.1. and Viz, =0%;=0. 


Remark 2. One has do?=y,,dx*da?, where the summation goes over « and f from 1 to 3. 
Here the fundamental tensor Yap 18 positive definite, i.e., its signature in the sense of Subsec. 
29.5.1. is 3. [f e is the curvature scalar associated to Y2g in the sense of Def. 29.5.7, then in 
the above theorem we can assume that e=1! or ¢=0 or e= —1. If this is not the case, then 
we modify S(f) in the theorem by a positive constant factor. 


31.3.3. The Dust Universe 


We have to determine S(t) of Theorem 31.3.2. That is, we ask for a cosmolo- 
gical model, i.e., a solution of (31.3.1/1), which is described by a Robertson- 
Walker metric and whose energy-momentum tensor 7! satisfies the set-up 
of Subsec. 31.3.1. For a quantitative discussion of the empirical data concerning 
the energy density « and the pressure p we refer to [43], pp. 711-713. The 
results presented there suggest the following set-up, which is called the dust 
universe: in (30.2.3/1), let »(¢)=0 and p(t)=o(t), where o(t) is the mean cosmic 
mass density at the time ¢.” 


Theorem. Jn the dust universe, 


: mS3(é) o(t)= M , (1) 
Some it go! 5 ‘ 
SO =F sa t S%)—e. (2) 


Remark 1. (1) and (2) can be derived from (2)—(4) in Subsee. 30.2.3. and from the conven- 
tions made above. Here x is the Einstein constant, A is the cosmological constant, and 


') Sec also [53, pp. 179/180}. 
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' : ds 
é=—1, 0, 1 in the sense of Remark 31.3.2/2. Further, S(f) = Th The normalization of ¢ 
di 


suggests that do? m Theorem 31.3.2 be regarded as dimensionless in the CGS-system. 
Then S(t) has the dimension of a length (like dé?, in the sense of the convention that time = 


path). Hence 3 mS%(t) is the volume of a (Euclidean) ball. S(t) is called the radius of the 


universe. (1) then means that the total mass J of the universe is independent of time (law 
of conservation of mass). WV is an integration constant that can be fixed arbitrarily (from 
the mathematical point of view). 


Remark 2. (2) is the famous Friedman equation. The case 4 =0 is of special interest (last 


but not least for aesthctical reasons). Einstein, in his later years (after Hubble’s discoveries) 
constantly pleaded that physical significance be only attributed to this case. 


31.3.4. Hubble’s Law 


From now on we shall set 4=0 in (31.3.3/2). Our aim is as follows. From the 
present knowledge of the mean cosmic mass density @ and the Hubble constant 
H (ef. Subsec. 31.3.1.) we want to determine the still unknown parameters « 
(with the possible values — 1, 0, 1) and M of Theorem 31.3.3. Thereafter it will 
be possible to solve (31.3.3/2) (with 4=0). Then the age of the universe can be 
determined from the explicit formula for S(é), where of course it is hoped that 
the theoretical value is in agreement with the empirical value indicated in 
Pubeec, 31.3.1. 


Hubble’s constant. The light cones in a Robertson-Walker metric are deter- 
mined from ds?=0, i.e., dé2=S2(¢)do2. From the isotropy of space, but also 
from the explicit line elements do? for each of the 3 cases e= =i, 0, il, ie ie 
observed that these light cones are translation-invariant in the o-direction. 
If one considers two observers 1 and 2 (Fig. 31.27), then for infinitesimal condi- 
tions (on a cosmical scale this means distances like those between neighbouring - 
galaxies) one obtains a =S(é,)d and b=S(t.)d. The proper time of the two non- 
moving observers is ¢. If 1 emits light of the frequency 7», then 2 receives this 
light with the frequency v., where ar =br» (=number of oscillations in the 
proper time intervals « and 5, respectively). With Ar=v)—1 and y=», this 


gives aad —1. For ¢,-é and f=f one obtains 

a vp S(t) i. 
Ay S(t) . _ S(t) : 
ee a0) r, and hence OS Gy : (1) 


Here we have made use of At=t) —t,=r (distance between the two observers). 


eae, 
i 
| 


| 
Fig. 31.28 
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Friedman’s differential equation. From Theorem 31.3.3 (with A=0) it follows 
immediately that 


—=—, 2 

530 | a 
This gives rise to a chain of conclusions that appears fantastic. If H(é) and o(é) 
are known at some time ¢=¢, (our present time), then the quantities « (that is, 
e= —I, 0, 1) and S(t) can be determined from (2). With V1 one ts. 8) ivan 
then calculates the total mass M. Thereafter (33.3.3/2) is solved for A=O. 


From the explicit solutions, it will then be possible (as we shall see) to determine 
io: the age of the universe. 


Dilatation (Fig. 31.28). For the considerations to follow, the following remark 
will be useful. The distance between two observers in the Robertson-Walker 
metric is determined by ds2=§2(é)do?. Consequently, with respect to the time- 
independent standard distance o we have s=S(t)o. As S(é) increases, s increases 
too. Thus the wavelength of the waves existing in the universe will increase in 
the course of time. This results in a red shift of the frequencies, which in its 
turn is described by (1). 


31.3.5. Solutions of Friedman’s Equation 


Theorem. The Friedman equation (31.3.3/2) with A=0 has the following solutions. 


CS ee als joe =O (1) 
wy 6 

s@)=Gm(* 5-1}, =Gm (22 for e=—1, (2) 

S(r)=GM(1—cos 1), t=GM(r—sinr) for e=1 (3) 


Here G denotes Newton’s constant of gravitation. 


Remark 1, These are special solutions. The general solution is obtained if one replaces ¢ 
by —é and by ¢+ const. In any case, !=t(t) is a monotonic function. Hence t=7(t) exists. 
Then the solution of (31.3.3/2) is S(z(#)). 


Remark 2. Tn all cases one has S(0)=0. For +20, the functions S(t) in (1) and (2) are 
increasing functions of t (and hence also of t). Hence the radius of the universe, S(x(t)), 
increases monotonically from 0 to «: the universe expands, (3) is the parametric represen- 
tation of a eycloid; cf. Subsec. 11.2.2. One obtains a pulsating universe. Now it is also 
clear how the age of the universe, 7’, can be determined: if H(7’) and 0(7) are known, then ¢ 
and S(7') are calculated from (31.3.4/2). This also determines 7’, where there are two 
possibilities in the ease e=1. Since Hubble’s law corresponds to an expanding universe, 
for e=1 one has to choose the smaller of the two possible 7 values (Fig. 31.29) 


Fig. 31.29 
gap ZIEGM ” 
Remark 8. 2500 years ago the greck philosopher Heraclides remarked: This world has 


been created neither by a god nor by a man. It was and will be a living fire in eternity, 
growing and ceasing in a temperate manner. 
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31.3.6. Friedman’s Models 


Hr : 2 
Critical mass density. If e=0, then In a na In¢, and hence, according to 
9 


(13:4. H()=s. With H-1(7)=18x 10" years one obtains 7’=12 x 109 


years for the age of the universe. To our present knowledge this is quite a discus- 
sible value. If the associated mass density ts calculated from (31.3.4/2), one 


obtains e,=6 x 10~30 — If g=0,, then it follows from (31.3.4/2) that e=1 


(pulsating universe). For o<g, one has e = — 1 (expanding universe). Finally, in 
all cases oue can determine the radius of the universe, S(7'). The present experi- 
mental value indicated in Subsec. 31.3.1. for the mean cosmic mass density 
does not yet permit a final conclusion as to which of the three cases e= —1, 0, 1 
is the true one. 


Line element dg2. The line element do2 of Theorem 31.3.2 has the form 
do? = dr? + f2(r) (dd? + sin? 8d¢?) 
where 
payer for 2— 1; 
J@)=" for e¢=0 and 
=e —e ")) for 2e=—1. 


je=5 


For ¢=0 this is the Euclidean line element in polar co-ordinate representation 
Q<r<o, 0O<#=<7, 0S 9<2n. For e=1, 


ry 


do? = dr? + sin? r d824+ sin? r sin? 6 dy? 
with O<r<zx, 0<8<n, and 0=9 <2z, is the surface element of the unit sphere 
in the 4-dimensional Euclidean space. S2(t) do? then is the surface element of a 
sphere of radius S(¢) in R,. Thus the pulsating universe can be imagined as 
the surface of a pulsating sphere in A,. 


Our universe. The following data are thought to be realistic (with certain fac- 
tors of uncertainty): radius of the universe 18 10° light years, age of the 
universe 11109 years. Today the red shift discovered by Hubble has been 
widely accepted as an indication of an expanding universe. 


31.3.7. The Big Bang 


The cosmological models described above begin at the time 0, and at this 
time the radius of the universe is S(0)=0. The universe springs into existence 
with a big bang: a huge fireball in an uproar of furious thermonuclear processes. 
The question is whether such an assumption can be supported by experiment 
today. Model calculations suggest that in this initial stage a relatively large 
quantity of hydrogen must have becn converted into heavier elements, especially 
‘nto helium. Further it can be assumed that a black body radiation developed, 

c'y 


which obeys Planck’s law, u(v, 7) =cv? C= 1)-!. Here » is the frequency and 
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u(y, 7’) is the energy density. ¢ and c’ are positive constants and T is a para- 
meter, the radiation temperature. In fact a relative abundance of helium has 
been detected in the universe. What is much more amazing, however, is the 
prediction and the discovery of the 3K background radiation (K =kelvin scale 
of absolute temperature). In the nascent state the radiation’ temperature 7’ 
must have been very high. According to the considerations on dilatation as 
described at the end of Subsec. 31.3.4., the expansion of the universe involves 
a large red shift, which is expressed by a decrease of radiation temperature in 
Planck’s law. It has been predicted that the present value of the radiation 
temperature should be about 3 K. In 1965, Penzias and Wilson indeed detected 
this 3 K background radiation (the precise value is 2.7 K). In correspondence 
with the cosmological principle it is isotropic (independent of direction). This 
discovery was rewarded by the 1978 Nobel Prize for physics. It is probably 
the most impressive experimental support of the theory of the big bang. For 
descriptions of this theory, cf. [58, 70]. 


We may indeed be very low in the cosmical 
intelligence scale, but it is a measure of 
our scientific advance during the last few 
centuries that we have come to realize 

‘ how unimportant we really are. 
(P. Moore, [44, p. 223) 


31.3.8. Birth of Life in the Universe 


When we ask for the birth of life on the earth, beginning with the big bang, we can 
distinguish three stages. 
1. Formation of the chemical clements. 
2. Formation of those biochemical molecules which constitute the structural elements of 
life: polysaccharides (cellulose, starch), amino acids, chlorophyll ete. 
3. The evolution of life from the most primitive forms to the flora and fauna of our time. 

Today there is no one who seriously doubts that the first of the above stages is a cosmical 
matter, while the third stage is a terrestrial one. Modern astrophysics has detected all the 
stable chemical elements in the universe, although in very different quantities; cf. (28%. 
p- 169]. Here it turns out that, besides the predominant elements hydrogen and helium, 
there are also carbon, nitrogen and oxygen in relative abundance. It can be imagined that 
the majority of the elements are formed within those processes which have been described 
in Subsecs. 31.2.2. and 31.2.3.: the life and the death of stars. As regards the second stage, 
there has been a remarkable reorientation of thinking in the past few years. The fantastic 
successes of radioastronomy and infrared astronomy of recent years have made it appear 
at least highly probable that the structural elements of life (or at least the component 
parts of these elements) are present in relative abundance in the universe. Books on astronomy 
published in the past few years give clear evidence of this: [Ae mec. 27), (20, pp, 5-37), 
[67, p. 165], and [80, p. 97, pp. 109-110]. The probably most impressive report on this 
problem, however, has been given in [28]. In what follows we shall make, in an outline form, 
at least some passing remarks on this subject, largely following the lines of the presentation 
given in [28]. 


Life on the earth. The oldest finds that contain traces of life on the earth have an age of 
3.1 X10° years. The age of the earth is 4.5 «109 years. Thus it can be assumed that primi- 
tive forms of life developed on the earth very early, 
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Molecules in gas clouds. Since the late sixties, a multitude of inorganic and organie mole- 
cules have been detected in interstellar gas clouds. A long list is given in [28, pp. 180/181]. 
Water as well as a great number of carbon compounds, including alcohol, are listed there. 
Here the Orion nebula is a most productive object. On the other hand it is known that the 
Orion nebula is a region with extremely young stars. That means, formation of stars 1s 
observed in a nebula that contains a multitude of organic moleeules. We also refer to 
f£5, pp. 102-105}. It is assumed that about 4109 to 5109 years ago the sun and the 
earth were in a similar situation. 

Dust. Today it is known that in the universe there exist dark nebulae of large extensions, 
which absorb the light emitted by stars. From this it is eoncluded that the size of the parti- 
eles in question must be 10-4cm: particles of metals, graphite and rocks. It is certamly one 
of the most remarkable successes of infrared astronomy that complex organic molecules 
have also been detected in these dust clouds: polysaccharides (such as cellulose) and strue- 
tural elements for chlorophyll. Explanations of how it is possible that such complex mole- 
cules exist in the universe have been given in Secs. 9 and 10 of reference [28]. . 


Birth of life. According to [28], life on the earth developed as follows. Shortly after the 
birth of the sun, it was surrounded by gas and dust, where (at greater distanees from the 
sun), a fairly great number of complex organic molecules were present (analogously to the 
Orion nebula and the neighbouring dust nebulae). The earth, which is formed only a short 
time later, contains no eomplex molecules, because the latter were destroyed in the region 
near the sun. Subsequently, however, such molecules are carried along from greater dis- 
tances in the interior of meteors which land on the earth. Under the terrestrial conditions. 
which are favourable at that time, finally primitive life develops from these molecules. 


Comets and meteors. To support the above theory, the light of comets has been analyzed 
in the past few years. It could not be expected that complex organic molecules would be 
detected, because they are destroyed in the neighbourhood of the sun (provided they are 
not protected from the influences of sunlight). Organic decomposition products of such 
desirable molecules have, however. been detected; [28, p. 90]. The results of the investi- 
gation of meteorites are as follows. About 3% of all meteorites belong to the class of 
“carbonaceous chondrites” (meteorites of grain structure, containing carbon). Their ages 
range between 4.5 x 10° and 4.7 x 10° years, and at least some of them do not belong to the 
solar system. In some of these meteorites, amino acids as well as cellulose-like molecules 
have been detected. These results support the above theory in a very impressive way. 


Planetary systems. Since the biochemical structural elements of life are widespread in the 
universe, it can be assumed that life has evolved at other places too, provided that the 
conditions were suitable, maybe similar to terrestrial conditions, The theory of the forma- 
tion of our planetary system, which is generally accepted today, suggests that planetary 
systems are abundantly present in the universe [28, Sec. 13]. This assertion is also support- 
ed by observations. So far a planetary system of another star has not yet been observed 
directly. But the existence of such a system should induce the star in question to perform a 
eertain periodic trembling motion. And such tremblings have in fact been observed. Among 
our neighbouring stars it is already the nearest but two, Barnard’s star at a distance of 
5.88 light years, that should have at least two planets according to this interpretation 
[45, pp. 87-88]. In [28, p. 137], another five stars at distances between 8 and 27 light years 
are listed, which have planetary systems. 

Habitable planets. According to [28], the estimated average distanee between two planets 
in our galaxy that are host to higher civilizations at a given time is at least 200 light years. 
Mutual visits are banished into the realm of fantasy (Star War will not take place), whereas 
a certain chance, if a minute one, is attributed to the possibility of receiving signals of 


extraterrestrial civilizations. 


Remark. F. Hoyle and N.C. Wickramasinghe continued their investigations from [28] 
in (29, 30]. We quote few sentences from [29]: “So it came about that the more we read 
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and the more we probed many and diverse arguments, the more surely we were pressed to 
the strange conclusion that it was in the comets where we must seek for the early develop- 
ment of life... we find that the case for a cometary origin of life, as well as for a continuing 
infall onto the Earth of bacteria and viruses, is reasonably established . . .”’. This book is a 
painstaking report about indirect proofs under the assumption that the above hypothesis 
is true. However the authors have no direct proofs (at least at the moment when the book 
was written). 


32. Geometry on Manifolds II (Forms) 


aoe. Tensors and Differential Forms 


32.1.1. The Vectors an and da*. Tensor Products 


We refer to the considerations described in Chap. 29. In Subsec. 29.2.2. we had 
introduced tensors (tensor fields, to be more precise) as local geometric objects 
with the transformation behaviour expressed by (29.2.2/1) (w=0). Now we seek 
to establish a more compact notation, dropping the representation in components 
and interpreting tensors as multilinear forms. 


Basis vectors. Let Jf be an n-dimensional orientable C*-manifold in the sense of 
Def. 29.1.2, and let U be a local chart of an associated atlas. With respect to the 
Cartesian co-ordinates «* in U, we form the vector é with the components &, =1 
for a=l and €,=0 for a+l. Here / is a given number, and a runs from 1 to n. 
(Following our previous convention, we speak of vectors and tensors instead of 
vector fields and tensor fields.) If, using the covariant transformation law 
(29.2.2/1), this vector is extended to all the local charts of the atlas, then a co- 


variant vector is obtained, which shall be denoted by = from now on. The 
corresponding contravariant extension gives a contravariant vector, denoted by 
dz'. dz' has nothing to do with an infinitesimal quantity. The notations = and 
dx’ ave, however, very suggestive because they transform according to the usual 
rules (chain oe total differential), Ba and dz’ are called basis vectors. 

Basis tensors. Agee om denotes the covariant extension, in the sense of 
(29.2.2/1), of the tensor é with €,.,=1fora=kto b=l, and £, ,=0 otherwise. 


Here k, ..., / are any fixed numbers between 1 and n. The contravariant exten- 
sion is denoted by dxr*@...@dzx'. Of course one may also form corresponding 


32.1, Tensors and Differential Forms 371 


: ; a 
mixed-variant extensions, for instance dx*@...@ 


ant These tensors (more pre- 
cisely: tensor fields) arc called basis tensors. i 


oe Gi interpretation). lf T,7is ® covariant tensor in the sense of Theorem 
lemons TELNET 
? 


GS ands... @dx 


is a scalar (Theorem 29,3.1/1(d)). In the sequel we shall call the scalar 7, with 
the representation by basis tensors as given above, a covariant tensor (of the 
corresponding rank). This is a co-ordinate-free notation. Changing the co-ordi- 
nates from x* to a” gives 
T=T, dz @...@dzr'=T;, dx*®...@dr". 

If one inserts the transformations for dz’, one automatically obtains the correct 
transformations (in the sense of Theorem 29.2.2). So it is possible to define ten- 
sors in this way too. Analogous interpretations shall be applied to contravariant 


a mixed-variant tensors, where one has to substitute, as applicable, = for 

oa 

Definition. (a) Jf 
T=T, ,dx*@...@dzx> and S=S8,.,dz*®...@dx° 

are tensors of equal rank, and vf Aand ware real numbers, en 
AT + pS=(AT, 5 +uSq_5) da*®...@dz’ . 

(b) (Tensor product). If S and T are any two covariant tensors, then 

Sorts, Tf. ar @...@dr’Qdr'@...@dx. 


Remark. This is the present variant of Theorem 29.3.1/1. In particular, AT? +S and 
S@7T are again] covariant tensors. It is clear how the definition for contravariant and 
mixed-variant tensors has to be modified. 


32.1.2. The Alternating Product and the Exterior Product 


If G, is the permutation group of Subsec. 28.6.2., then (— 1)’, for p¢G;, has 
the meaning indicated there. In all of our considerations we again refer to an 
n-dimensional C*-manifold. 


Definition. (a) (Alternating product). If T 1s a covariant tensor of type (0, k), then 


1 
Alt T=— D (-1) Toa... noe ..Odz . (1) 
kl PES, al 
(b) A covariant tensor of type (0, k) ts called a k-form if T= Alt T. 
(e) (Exterior product, or wedge product). If T is a k-form, and if S is an l-form, 
then 
k+l)! 
Trgae 2 alt (1798). 
kM! 
Remark 1. (1) is the covariant version of (29.3.1/2). Alt 7’ is again a tensor of Type (0, 4), 
called the alternating part of 7’. 


Remark 2. We make the convention that scalars shall also be called 0-forms; 1-forms are 
covariant vectors. If S=A is a 0-form, then we set TAS=SAT=AT. 
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Lemma. (a) 7 as a k-form af and only if Tya,...n) =(— 1)? Va.n for all pe Sy. 

(b) Jf k>n, then there is no non-trivial k-form. 

(¢) /f OSk=n, then at every point of an n-dimensional C”-manifold there exist 
n 
I 
Remark 3. If 7 is a k-form, then it follows from (a) that 7'g..5..5..¢=0 (two equal indices, 
no summation), This proves (b). Further, in the case (c) it is noted that every k-form can, 


exactly ( linearly independent k-forms. 


at every point of the manifold, be obtained as a linear combination of the ( ‘) special &-forms 


> (-1)? dz?@@ ... @dz?® with tsa<... <bsn. 
pe Sp 


MOTO Genet) (en err (2) 


Theorem. (a) Jf T ts a k-form, and tf S ts an l-form, then TAS is a (k+1)-form. 
(b) Let Toy be any k,-forms. Then 


Pay A(Leay Tey) =(Tayd Ta) A Tey = Pay Ley Te) 
and (tf ky =k) 
PayA( Peay t+ Tr) = TayA Tey + Tay Te - 


Reduced representation. In particular one has dx Adz! = —de'Adx*. But this 
shows that every k-form can be written as 
T=k1 SY %,,dx* A... Adz’. (2) 


lsa<...<b3Sn 


This is the reduced representation, because the summation is not, as prescribed 
by the sum convention, taken over the full range of the indices. The missing 
coefficients are calculated from Tyiq,...2)=(—1)?Ly..»- In particular, dxr*A... dx? 
(k factors, with 1=a<...<b=7n) is a (linearly independent) basis in the space of 
k-forms. 


Remark 4. If one admits the summation in (2) to be taken over the full range of a, ..., 6 
from 1 to , then the factor k! does not occur. Further when changing from the local co- 
ordinates z* to the local co-ordinates 2”! one obtains 


Ox” Ox? 
SUES ICAI Saree eal (= da’) aes (jaa d't) F (3) 


oa’ © 


which is calculated in the usual, obvious way. 


32.1.3. Exterior Derivative 


Lemma. [f T=T, ,dz*A...Adz® ts a k-form, then 


o 
dT = Fa dae cls: 
Ox 


is a (k+1)-form (exterior derivative). 


Remark 1. Recall 7,..,=7'q,...(z). The lemma is a generalization of Theorem 29.3.2(a). 
We make the convention that d7 =/,,da° if 7’ =j(«) is a 0-form (scalar). 


32.1.5. 32.1. Tensors and Differential Forms a 
eid itemise ee ee 


Theorem. If T is a k-form, and if 8 is an l-form, then 
d(7+8)=dT+d8, d27=d(d7T)=0, (1) 
dS =a As (1k T Ads 

with k=l in the first equality ‘ 


Remark 2, It is of special importance that a27=0. This property follows from the anti- 
symmetry of the exterior product and from the commutativity of partial derivatives. 


32.1.4. n-Forms 


We consider an n-dimensional metric space as defined by Def. 29.5.6/1. The 
associated fundamental tensor shall now be written in the form g=gud2' @dx'. 


Further let |g] =/det g,|. According to Theorem 29.3.1/2 one has Vig’ (a’)| = 

CU!) aa NOR 0 
aay Vga), where the symbols have the meaning indicated previously. 
According to (32.1.2/2), an n-form T in its reduced representation has the form 
ae ely. va 


Theorem. » =V|g(x)| dz!A...Adx” ds an n-form. 
Remark. What is meant is that, for co-ordinate transformations a’* =2'*(z!), the following 
relation holds: 

Vig(z)| data ... Ada” =V\g(x’)| dx’tA ... Ada’™. 
This is easily verified on the basis of (32.1.2/3) and of the above transformation law for 
Vig(z)|. 


e 


32.1.5. Theorem of Poincaré 


If A isa (k—1)-form in an n-dimensional orientable C*-manifold, then d7’ =—0 
for the k-form T=dA. We will state the converse to this proposition. 


Theorem (Poincaré). If T 1s a k-form with dT =0, then locally (7.e., in a suitable 


neighbourhood of every point of the manifold) there exists a (k—1)-form A such that 
= dA 


Remark 1. For a given 7, dA=T is a system of linear partial differential equations of 
first order for the components of A. The theorem gives a positive answer to the question of 
whether the necessary integrability condition d7’=0 is also sufficient (at least locally). 
If in a local chart one has (in the reduced representation) 

ipl > Pde... Naa’, 


!sa<...<b=n 


A=(h—1)! os A deh a. dee, 
WEY 8 5 NO 
then, locally, 
i 


Aea=j Bf Pre nalta) lat 
0 
is a solution of 7 =dA (cf. [40], p. 218). 
Remark 2. If Ay is a special solution of dA = 7, then the general solution is given by A = Ag+ 
+dB, where B stands for an arbitrary (k —2)-form. 
26 Triebel, Math. Physics 
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32.2. Integral Calculus on Manifolds 
32.2.1. Integrals of n-Forms 


We shall now extend the investigations made in Subsec. 29.3.3. From now on 
we shall always assume that M be an n-dimensional metric space (in the sense of 
Def. 29.5.6/1) with the fundamental tensor g=g,dx* @dx'. Given an n-form T in 
the reduced representation, 

SI siete aol Naan (1) 


with a compact support supp 7’, where, for the time being, we assume that 
supp 7’ lies in a suitable local chart U (Fig. 32.1). If 7;,,(x) is integrable in the 
Lebesgue sense, then we set 


i IRE Googe: 


0 J i 
Cee 


It is easily seen that this definition is independent of the choice of the local co- 


(2° 
ordinates. This follows from 7j..n(7’)=T1..n(2) a 
admissible co-ordinate transformation. 


'k 


, where xr =x" (r') is an 


Definition. (a) If T ws an integrable n-form on M with a compact support, and if 
{pehe 4a [Pek of unity tn the sense of Lemma 29.1.3, then 


pie Pa J Po: - 
(hyry f 7s an SO MTUE function on M uth a compact support, and 7f jw is the 
n-form of Theorem 32.1.4, then 
ffu=fT where T=f(x)p. 


Remark 1. 7 is said to be locally integrable if 7) _,(a) p(x) is Lebesgue-integrable for each 
of the above functions y, in the associated local chart. Since Z’ has a compact support, 


°° 


there is only a finite number of non-zero summandsin 2 f T@p,. (a) is meaningful and 
[G—| 
does not depend on the particular choice of the partition of unity. 


Remark 2. Uf /(z) is an arbitrary locally integrable function on J/, and if x is the charac- 
teristic function of a compact set D in M, then we write ffu=f fx. 
D 


32.2.2. The de Rham Operator 


oe 
diheorem: 275 =—¢ Ak YS a contravariant vector, then 


*E = (— 1)! g(a] E(x) da! A... Ada?! Adz! A...Adx” (1) 
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is an (n—1)-form, and d(*5) = (div §)u, rhere yw is the n-form referred to in Theo- 
rem 32.1.4. 


Remark, This means that in changing the co-ordinates from 2 into a’! in (1) one only 


needs to replace a by x’ (as well as g(x) by g’(x’) and £"(x) by §’"(x’)). According to Subsce. 
BB Zee 


dig == 


it a 
a ant St lal") (2) 


is a scalar. The last assertion of the theorem then follows from Lemma 32.1.3. 


32.2.3. The Stokes Theorem 


As usual, let M be an n-dimensional metric space (in the sense of Def. 29.5.6/1). 


Definition. Jf 1=m<n, then the subset D of M is called an m-dimensional (smooth) 
submanifold of M if, by choosing a suitable atlas {V,; Fy}, it ts always possible to 
achieve that 


FADO we) = Hae | IE Gay qe! == boo =y"” =(} 
in every chart U,=F,(V,) (Fig. 32.2). 


Fig. 32.2 


Remark 1. {V,; F;} has the meaning set out in Subsec. 29.1.2. Tt is easy to see that D is 
an m-dimensional C*-manifold in the sense of Def. 29.1.2. 


Remark 2. In an arbitrary local chart, the image of D (as far as it is covered by this local 
chart) can be represented as Ge =P), PSI cng Wy DUVL PSAly coon HO (parametric represen- 
tation). 


Remark 3. Of special interest are (n —1)-dimensional submanifolds (hypersurfaces), which 

can be represented as S(%) =0 or as S(x) =c in local charts, where S(x) is a C°-function with 

grad S(x) +0. 

Lemma. If D is an m-dimensional submanifold of M with the parametric represen- 
. . Ay ap 7 ' 2, 

tation x* =x" (2") in local charts, and if T= Paya \...Adx* is a k-form on M, 

then 


rn nop 
Cx by Ox by 
in i4pl aoe i icone iW 1 
1 R= ean iS da Ja Neer ¢ ( ) 
CC ats 1 bp 
ay ay ea) ao da 4A...Ada 
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is a k-form on D. Further, 
(T+8)p=T'pt+S8p, (LAS) — lpi ne (d7);=d(7 5) @) 
where S is an l-form (where k must be equal tol in the first formula of (2)). 


Remark 4. The summation in (1) goes from 1 to ” over a, and from 1 to m over 6,. 7p is 
the restriction of 7’ to D, the induced form. 


Theorem. Let Q be an open set in M, such that Q (the closure of 2) is compact. 
Further let the boundary @Q=Q\Q be an (n—1)-dimensional hypersurface. If 


ee. 
Eu f ae OS @ contravariant vector, then 


Ox 

f(*8)ao = f (div e= fare). (3) 
dQ Q 2 

Remark 5. 02 is an (n —1)-dimensional compact orientable C-manifold. Theorem 32.2.2 and 

the above lemma then show that the first integral in (3) can be formed by Def. 32.2.1. Fur- 

ther Theorem 32.2.2 shows that the last two integrals in (3) are meaningful and equal. 


Remark 6. With the use of (32.2.2/2) and of local charts it is possible to reduce the theorem 
to Theorem 9.3.1/2. Here the direction of the normal y in Theorem 9.3.1/2 must be chosen 
suitably (which corresponds to fixing an orientation on 0Q; cf. [17], p. 26)!) 


32.2.4. Leray Forms 


Given an open set 2 in the n-dimensional metric space M (Fig. 32.3). Further 
let S(p) be a C»-function on 2 with grad S +0 in 2. Then 
S,= {p | pe Q, S(p)=t} 
is a family of (7 — 1)-dimensional C~-hypersurfaces (provided that S, is not void). 


Theorem 1. If pu is the n-form referred to in Theorem 32.1.4, and if dS=S8,d2", 
then in Q there exists an (n—1)-form wg (Leray form) such that u=dS \pys. Here 
the restriction (ug)s, of Hs to S, ws unique. 


Remark 1. The geometric meaning of wg becomes clear if one compares p=dS Aug with 
the conventional infinitesimal classical analogue dz=dao- dy. If, for instance, |g(x)| =1, 
then s« corresponds to the n-dimensional volume element dz, dS corresponds to the vector 
jgrad S|dy (vy being the normal vector), and (tus)8,| grad S| corresponds to the (7% — 1)-dimen- 


sional surface clement do on the surface S; (Fig. 32.4). 


a6 
5 
if 
aus 
Fig. 32.3 Fig. 32.4 


Theorem 2. [f y zs an integrable function with a compact support in 2, then 
fpe=fal f vus - 


!) A recent but nevertheless clementary description of integration theory on manifolds 
(including Stokes’ theorem) may be found in [60]. 
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Further, 


d 
aE [ve= [ves for almost allt. 
i é 


S<t Oe 


Remark 2. The theorem becomes clear if it is compared with the classical analogue in the 
sense of Remark 1. S<é means the set {p | »€Q, S(p)<t}. In the sense of Lemma 32.2.8, 
it would be more precise to write (4/s)s, Instead of srg in the above theorem. Since seg is 
an (n—1)-form, all integrals are defined. f dt is an integral over R,. Finally, in the deduc- 
tion of the theorem it is seen that the oricntation on S, must be suitably chosen (otherwise 
one has to replace 15 by —y15). 

Remark 38. Let S(p) and 7(p) be any two C~-functions on 2, with dS+0 and d7=+0. 
Again let S;={p | S(p)=t} and 7,={p | T(p) =u} (Fig. 32.5). One has 


AS AAT =S,,7 ,dx* Ada! = > (Sil ys de" Ada . 


lsk<l=n 
ve 
|p 
74 
Fig. 32.5 


Hence it follows that grad S is parallel to grad 7 if and only if dS Ad7'=0. If one requires 
that dSAd7'+0, then the surfaces S, and 7, intersect at a positive angle of intcrsection 
(provided that t and u are chosen suitably). Let 3,=S:9 7. By iteration it follows from 
Theorem 1 and Theorem 2 that there exists an (n —2)-Leray form wg_7 such that ~=dSA 
aT Aus,r- One has 19,7 = —¢7,s- Further, with the assumptions of Theorem 2 one has, for 
almost all ¢ and u, 


fara Bk | 
Me ears ot 


ay S<t 
tu Tu 


ones Distributions on Manifolds 


32.3.1. Sealar Distributions 


In Chap. 22 we considered distributions defined on domains @ of R,. Now 

we are going to extend Def. 22.1.2/2 from D’(Q) to C~-manifolds M. Here we are 
faced with difficulties which can be overcome in different ways. We shall follow 
the lines of the presentation given in [17]. We shall always assume that M isa 
metric space in the sense of Def. 29.5.6/1, with the fundamental tensor g = gj:da* ® 
@dzx'. Asin Subsec. 32.1.4., let |g] =|det gl. Finally, we denote by DM) (by anal- 
ogy with Subsec. 22.1.2.) the complex-valued variant of C7(M) as referred to in 
Subsee. 29.1.3., that means the class of all complex-valued functions with com- 
pact support that are differentiable on M up to arbitrary order. 
Definition. A complex-valued linear form T on D(M) is called distribution on M 
if for every local chart U there exists a distribution Ty€D'(U) such that T(p) = 
=T,(Vig(@)l ¢(x)) for all p€ DU). The space of the distributions on M ts denoted 
by DEM). 
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Remark 1. Here 7 is called a linear form if T(A,1 +A gq) =4,T'(1) + Ag 7 (2) for arbitrary 
complex numbers 4; and Ay and for g,€D(M) and g.¢€D(M). The local chart U is an open 
set in &,. One has U=/(V), where V is an open set in M and F is a homeomorphism (cf. 
Subsec. 29.1.2.; Fig. 32.6). So, in the above definition, it would be more precise to write 
gy €D(M) with supp g@cV and (F—)(x) instead of p(x). But there is no fear of confusion. 
Of course .D’(U) has the previous meaning of Subsec. 22.1.2. However, since V|g(x)| }| p(x) € 


EDU), Ty(V|g(x)| pla) is meaningful. 


(x) 


Remark 2. Distributions are added and multiplied by complex-valued C-functions in a 
way analogous to that set out in Def. 22.1.4/1(b, c). D’(.W) then becomes a linear space. The 
differentiation in the sense of Def. 22.1.4/1(a), however, is somewhat more complicated. 
leading to the definition of tensor distributions. 


Regular distributions: By L'°°(M1) we denote the class of all complex-valued lo- 
cally integrable functions on M, in the sense of Remark 32.2.1/1 (complex 
variant). If fe L1°°(M) and »€ D(M), then 


Tp)=f fre (1) 
is meaningful according to Def. 32.2.1. One has T'¢€D’(M). For, if y(xr)€ D(U) in 
the sense of the above notation, then by Subsec. 32.2.1. one has 


T(r) =f Vig@)l Kx) plz) de, 


where f(x) is again the function transferred to U. So T', €D’(U) equals Vig(x)I f(x) 
This also gives us a motive for the above definition. Moreover it is seen that the 
correspondence between /¢L/°°(M) and the associated distributions in the 
sense of (1) is one to one, provided that one identifies two functions f(x) € € Liee( Mf) 
and g(x)€LY"(M) whose images f(x) and g(x) in local charts coincide a.e. (in the 
Lebesgue sense). In this sense (by ae with Subsec. 22.1.3.) we write L'°°(M) c 
<D(M), and fe LPM) is called a regular distribution. 


5-distribution: If p¢M and pe DM), then we set 6,(~)=¢(p). This is the anal- 
ogue to the 6-distribution of Subsec. 22.1.3. One es O67 (iM). li ieee, local 
pit with /pe U, then, for pe D(U), 
(F-'¢)(p) =d(F'p) = (6p) v (Vig@)I g(a), and hence 
(Ep 
brie) 
Vig(Fp)| 
where for this time we have distinguished, for the sake of clearness, between 
p(x) €D(U) and its image F-'!pe D(M). 
Compatibility (Fig. 32.7). If VV’ +0 and g€D(VN V’), then let p(x) and ¢’(x’) 
be the functions transferred to the local charts U and U’. If T¢D’ (M), then 


Ly Vigle)l oe) = ToAV ge) 9 (@’)) « (2) 
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(x) (x!) Fig. 32.7 


Theorem. (a) Let {J,.: Fy} be an atlas of M with the local charts U, = F,(V,). Lf the 
distributions Ty,€D'(U,) satisfy the compatibility condition (2) for arbitrary 
pairs (U,, U;) instead of (U, U’) then there exists a unique distribution T€ DM) 
in the sense of the above definition (with U;, instead of U). 

(b) A complex-valued linear form T on D(M) is a distribution if and only if for 
every compact set Q in M there exist a positive number C and a natural number N 
such that, for all p€ D(M) with supp gc Q, 

ITig)leC sup S leye).12,(P)l - (3) 
pée® |ejaN 
Remark 3. Part (a) is a localization theorem in the sense of Theorem 22.1.4. So it will 
suffice to consider distributions locally, by analogy with the local geometric object consid- 
ered in Subsec. 29.2.1. If {g,} is a partition of unity associated to {V;}, as referred to in 
Lemma 29.1.3. then it is possible to determine 7’ by 


Tp)= Z To V gle) Gul) G(x), EDA). 


Remark 4. (3) is best interpreted in local co-ordinates. It is the analogue to (22.1.5/1). As 


n 
in that case. «—(x,, .... %,) is a multiple index with |x| = oa,;. Further, @- is a ten- 
il nt } | 1 Ry ooByy 


3 


sor of type (0, | ), where g is subjected to a total of |a covariant differentiations with respect 
tO 44. -..; &,- The formula shows that distributions can also be differentiated, but the factor 


} g(a.) in the above definition suggests covariant differentiation. 


32.3.2. Tensor Distributions 


Definition. (a) D')(M) is the class of all tensors @ of type (7; s) with compact sup- 

port and complex-valued components that are differentiable up to arbitrary order. 
(b) A complex-valued linear form T on D&"(M) is called a tensor distribution of 

type (r, s) if for every local chart U there exists a system of distributions T ak uae 


€D’(U) such that 


Peat pe 
Te)=T ys, .2, Vig@)l oO %a)..0,42)) (1) 
a C 7] x , 
for all g=g¢''™ peak 7-8. Og Od... dx" EDE"M) with compact 
” Ox On * 


support in U. The space of these tensor distributions 1s denoted by D'*)(M). 


Remark 1. In part (a) we now admit (in contrast to previous definitions) that the compo- 
nents of the tensor 


7) 0 
es Dh poo 
Vay eee ia oo 


@ da’! Q... @dx’ 
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are complex-valued. By analogy with the notation in Subsec. Peosell on ene) in 


(1) is the corresponding component in the local chart U. The requirement that these 
components be differentiable up to arbitrary order is meaningful, because M isa C*-manifold. 
The support of y is again determined in the local charts as the closure of the set of all 


points x for which at least one of the components Ce ee) of @ is different from zero. 


Remark 2. Part (b) of the definition generalizes Def. 32.3.1. Thus, in particular, D/(-( Af) = 
DM). Addition of tensor distributions and multiplication by complex numbers are defined 
in an obvious way. This makes D’("5)(J/) a linear space. 


Regular tensor distributions: If the tensor 7 of type (r,s) is locally integrable 
(i.e., if the components of 7’ are locally integrable), then 


4 nod hs ee) & 
T(¢) =f ive . by... 6? : "a jena? (2) 


7) oe : 
one ar @da" €D”"(M), is a tensor distribution. If one again 
Cee 
identifies locally integrable tensors whose components are equal a.e., then one 
obtains a one-to-one correspondence between these locally integrable tensors and 
the associated distributions in the sense of (2). 


with ¢= oe 


32.3.3. Covariant Derivative and Coderivative of Distributions 


Definition 1 (coderivative). For p= {gp ""s, | JED), 


@ 1.08 Aj a4-..8 
(69) eed = = Uioean 
are the components of 69. 

Remark 1. It is immediately seen that dp€D”)(M). 


Lemma. /f P= ee OC and if pap l’s, 2 JE DOTY M), 
then 


B,D» Ayhy..8 by ..b My. 
f iP t “Oy og stig? ie *b bp lt =f ie : ay...a,(P) ’ 8b ..Dyl o hy 
Remark 2. This formula is the starting point for the definition of derivatives of tensor 
distributions (especially also of scalar distributions). 
Definition 2. (a) (covariant derivative). Let T={1 ry a }€D'(M). Then 
W = ee Say where 


(VP)(~)=T6p) for geDertM). 
(b) (codertvative). If T= {Tsp JED'C+LOM), then dT = {(6T\! ee 
where (6T)\(p)=T(V ~) for pe D@)(M). 


Remark 3. Here Vp={p"!""",,__9,:p9} for g={p 4, _.o,}- The definitions are meaningful 
and agree with (1) for smooth distributions. In case (a) one has V PED 8+ D(A), and in 
case (b) one has 07’ €.D’@.r)( AL). Here s and 7 are integers, r=0 and s=0. 


div and grad: If 7¢€.D’C%)(H/), then let div 7’ = — 67’; so in particular one has div TOD NM Ve 
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Tf 7¢D(M), then let grad T= V7; so in particular onc has grad 7 €.D’,')( AL). Hence 


grad T(y)=—T(divg) for TED (MM), pEDO.YAD , (2) 
div T(g)=—T(gradg) for TEDUM%M), peD(M). (3) 
In particular, grad g ={g,,} for p€D(M), and 
il DS 
li — Le Og ears ee fe 
OS neers (Vig(z)) @(e)) (4) 


for p¢ DU. (M). The validity of the last formula follows from Theorem 29.3.2(b), with the 
use of the normal co-ordinates of Lemma 29.5.6/2. 


32.3.4. The Wave Operator 


In this subsection, let M be an n-space-time, that is an n-dimensional metric 
space with the signature xn —2 (Lorentz metric); cf. Subsec. 29.5.1. For n=4 one 
has the usual space-time. 


Definition (eave operator). If Te D’(M), then 
OT =div (gM(grad T)y) =div (99 Ty). (1) 
Remark 1. The definition is meaningful. One has g*(.7 7),;€D’C.°( 1), where multipli- 


cations of tensor distributions by C~-functions are defined in the usual way. In particular, 
OT ¢€D’( 3M) then is again a scalar distribution. 


Remark 2. If 7 is a smooth function, for instance 7'¢€D(M), then it follows from (32.3.3/4) 
that Bs 
il 0 ( —— oT 
T =—— —, (Vig(x)| g(x) = )) : (2) 
Vigte)| a" ue 
In normal co-ordinates (Lemma 29.5.6/2) one then has 


(= Il eT e2T 
= = : 3 


r={ 


But this is the usual wave operator. Hence (2) is the invariant version of (3), and (1) is the 
extension of this version to distributions. 


Remark 3. From (32.3.3/2) and (32.3.3/8) it follows that 
O%(@)=7T( Op) for Tev(M) and peD(M). (4) 
Hence CO is formally self-adjoint. 


General wave operator. Let a= {a'} be a vector with C*-components, and let b 
be a C>-function, and 7'¢€D’(M). Then PT'€D’(M), where 


PT=OT4+a{(VT)+67 . (5) 
The analogue to (4) now reads, for 7'¢ D’(M) and pe DM), 
(PT)(¢)=T(P*p) with P*p= Og —div (a9) +bq. (6) 


32.3.5. Distributions of Type f(S) 


: . Qo ¢ 5 Yoo a 
Consider the same situation as in Subsec. 32.2.4. The open set 2, the C func 
tion S(p), the surfaces S;, and the Leray form py shall have the meaning indicat- 


ed there. 


382 33. Wave Equation on Curved Space-Times 33.1.1, 


Theorem. [f f¢ DR), and tf 


MS\9)=H fens) for peD(Q), (1) 
t 


then f(S)€D’(Q), and Vf(S)=f(S) V8. 
Remark 1. (1) is meaningful, because S pHgs€D(R,). f CD #,) is the distribution derivative 


St 
of f. Then (8) has to be determined as shown for (1). 


Remark 2. Tf Bezby ea then it follows from Theorem 32.1.4/2 that 
S) (p) =f H(t)dt {eres=SHS(p)) PP) t- (2) 
Sy 
This justifies the set-up (1). 


Remark 38. /(S) depends continuously on f: {;>/ in D’( Rj) (thatis, fi(p) > f(y) for ally € D( R,)) 
implies that /;(S)~/(S) in D’(Q) (that is, /;(8)(p~) ~/(S)(p) for all gE D(Q)). 

Remark 4. If S and 7 as well as ps p have the meaning indicated in Remark 32.2.4/3, and 
if f¢€.D’( Ry), then it follows that /(S, 7’) €.D’(Q), where 


KS, P\P)=K_ f ers,r) for peD(Q). 


“tw 


33. The Wave Equation on Curved Space-Times 


33.1. Characteristic Surfaces and Singularities 
33.1.1. Characteristic Surfaces 


In this chapter, an n-dimensional Lorentz-metric space, in the sense of Re- 
mark 29.5.6/2, will be called an »-space-time. If n =4, we speak of a space-time, 
as in the chapters on General Relativity. In Sec. 33.1. we shall consider n-space- 
times, whereas later on we shall restrict ourselves to the physically interesting 
case n=4, 1.e., to space-times, 

Definition. Let M be an n-space-time. 

(a) An m-dimensional submanifold of M is called space-like if every curve es) 
ole oles? 
a6 dy © 

(b) An (n—1)-dimensional surface S(x)=0 is called characteristic surface tf 
dS +0 and g(x) S84=0 on S(x) =0. 


that lies in rt 1s space-like, that is tf g*(x(s)) 


Remark 1. Except for the vertices of characteristic conoids (cf. Remark 2), we shall always 
assume that all the surfaces considered are smooth (differentiable up to arbitrary order). 
Then the definition is meaningful. Spacc-like surfaces have already been described in 
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Subsec, 31.1.3.; see also Def. 32.2.3, where the concept of a snbmanifold has been defined. 
In part (a) above, 1=m=n—1. Part (b) eorresponds with Def. 29.5.3. dS +0 means that 
at a point « where S(«)=0 not all the derivatives S,;(a) vanish simultaneously. S(z) is 
always assumed to be an (at least locally defined) C~-function. 


»L- Pe stetil ; 7 ny ye 7 ] 
Remark 2. Characteristic conoids and rays shall have the same meaning as in Subsec. 
29.5.3. dS +0 is not satisfied at the origin of such a conoid (a in Subsec. 29.5.3.). 


Remark 3. If S(v)=0 is an (x —1)-dimensional surface such that dS+0, and if a*(s) is a 
k 

curve within this surface, i.e., if S(a*(s)) =0, then it follows that S., ml 

{S,,;} is orthogonal to S(x)=0 (in the Lorentz metric). — 


=0. Hence grad S= 


Theorem. (a) /f S(v)=0 is a characteristic surface, then the rays (null g eodesics) 
kt 

lying 7n this surface ure identical with the solution curves x*(s) of SE gS a(x"(8)). 
8 


(b) If Pf= Of +a'(vf)y+ Of zs the general wave operator of (32.3.4/5) for arbitrary 
C~-functions f, then an (rn —1)-dimensional hypersurface (submanifold) S(.x) =0 is 
a characteristic surface if and only tf the restriction of Pf to S(x)=0 can be expressed 
by the tangential derivatives of the restrictions of f(x) and (Vf) (x) to S(x)=0. 


Remark 4. By Theorem 29.5.4, rays and null geodesics are identical. Now it is relatively 
easy to deduce part (a) from 


dx* da! _ : 
Gil = = = Yd “SryGQ Sy = G9 Si ySiy =O - 


ds ds 
Further it is seen that the contravariant version g*S,, of the orthogonal vector grad S 
(cf. Remark 3) is a tangential vector. 


Remark 5. To explain part (b), we may assume that S(x) =a! It then follows from (32.3.4/2) 
72 
ia Ppsg™ ae .... Where the remainder + ... includes terms of the form f(x), /,;(2) 


: (az")? 

and fix-(x), where k=1,...,m and r=2,...,. Hence this remainder can be expressed by 
: ae 0 

the tangential derivatives aad’? Bah of fand vf. So the requirement that P/ be express- 

ible in this way is identical with g11(x)=0 for S(z)=0. But since S(x)=21, the condition 

that g!(x)=0 for S(z)=0 is equivalent with S(x)=0 being a characteristic surface. This 

proves (b). 


Remark 6. Part (b) shows that the Cauchy problem for Pf in its usual formnlation makes no 
sense for characteristic surfaces. f(z) and V/(x) cannot be given arbitrarily on a charac- 
teristic surface S(z)=0 when solutions of Pf(x)=0 are desired in a neighbourhood of this 
surface. For it follows from Pf=0 and from part (b) that there is a dependence between 
f(z) and Vf(z) on S(x)=0. The typical initial value problem for the wave equation with 
data given on characteristic surfaces has only f(z) given on these surfaces. Later on we shall 
preferably deal with the Cauchy problem with data given on space-like hypersurfaces. In 
this case the problem then has the usual formulation. 


33.1.2. Initial Value Problems for Characteristic Surfaces and Null Fields 
Theorem 1. If o ts an (n—2)-dimensional space-like submanifold, then locally 


(i.e., ina suitable neighbourhood of an arbitrary point of a) there exist exactly two 
different characteristic surfaces which contain o (Fig. 33.1). 
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as Fig. 33.1 Fig. 33.2 


Remark 1. If o is a closed compact (mn —2)-dimensional space-like submanifold, then the 
theorem holds globally, i.e., for a suitable neighbourhood of the total surface o. Let Sux) =O 
for k=+1, gyz(z)=1 for k=1,...,2—1, and gyp(x)= —1 (n-dimensional variant of the Min- 
kowskian space), and let ¢ be the surface of an (n—1)-dimensional ball in the plane 
z"=(). Then the two characteristic surfaces desired are the two circular cones that include 


an angle a with «”=0 and pass through o (Fig. 33.2). The general case is the locally 


diffeomorphic distorted variant of this figure (Fig. 33.3). 


6 Fig. 33.3 0 Fig. 33.4 


Nonlinear differential eqnations: Theorem 1 and Remark 1 are the local and the global 
version, respectively, of the solution of an initial value problem for a nonlinear partial 
differential equation of first order. Desired is a function S(x) with dS +0 which is differen- 
tiable up to arbitrary order in a (local or global) neighbourhood of a, satisfies the equation 


iy ae oe f int ith S(x)=0 1 

g(x) ae ele or points « with S(x%)=0, (1) 
and whose associated surface S(a) =0 includes the submanifold ¢ (locally or globally). Other 
than in the linear theory, one obtains two solutions instead of a unique one (o is space-like 


as in the above theorem). 


Huygens’ construction: As in geometrical optics, the two characteristic surfaces 
of Theorem 1 can be constructed as wave fronts. For every point x€o one con- 
structs the characteristic conoid A, referred to in Theorem 29.5.3/1 (Big. 33.4), 
The two characteristic surfaces of Theorem 1 are then (locally or globally) the 
enveloping surfaces of these conoids, that means 6(U K>). 

ve 


Null fields: The above problem (1) can be extended as follows. In an open set 
2 of M, a C»-function S(x) with dS +0 is desired, such that 


as as 


Al 
PN) Ge Ge 


=0 for x€Q (2) 


(null fields). Here initial data S(x) =S)(x) for x¢€ A are given on an (nx —1)-dimen- 
sional hypersurface A in 2. So(p) is a C?-function on A, and in addition it is 
required that So(p)=¢=const yields an (n —2)-dimensional (smooth) submani- 
fold of M for every admissible c (Fig. 33.5). 
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Fig, 33.5 


Theorem 2. Let {p | So(p)=const} be (n—2)-dimensional submanifolds of M. 

(a) If A is space-like, then in a suitable neighbourhood U of an arbitrary point 
yc.l there exist exactly two null fields S’(x) and S’’(x) such that S’(x)=S" (x) = 
Dau) jones” 1 A. 

(b) Lf A rs a characteristic surface, then in a suitable neighbourhood U of an 
arbitrary point yCA there exists exactly one null field S(x) such that S(x) =So(x) 
fomPne UO. 


Remark 2. The null fields can again be obtained by Huygens’ construction. | 


33.1.3. Caustic 


The characteristic surfaces and null fields of Theorems 1 and 2 of Subsec. 
33.1.2. can be constructed as follows. (The constructions are started in the 
proofs of these thcorems, and subsequently it is shown that the surfaces obtained 
in this way have the required properties.) Let o again be an (x — 2)-dimensional 
space-like submanifold (Fig. 33.6). If yéo, then let 

gly) = 0 
for all tangential vectors ¢= {¢'} to o at y. The set of all these orthogonal vectors 
& — {5} span a 2-dimensional plane (in the space of the vectors). In this plane 
there are two null vectors, &4)(y) = {€*c1)(y)} and €)(y) = {5*a)(y)}. Now one deter- 
mines the two geodesics that pass through y and have the directions of these null 
vectors, 

d2x* pay da! dak ale 

ae OK = Os 1 

aa ttimb as ap=o “Oa GO) (1) 
where &*=&',(y) or 6 =&)(y). By varying y€o one then obtains two surfaces 
S, and S, which are spanned by the set of all these null geodesics. These arc the 
desired characteristic surfaces of Theorem 33.1.2/1. If one of these two surfaces, 
say, S=S,, is fixed, and if o is parametrized by x* = x*(A), ..., ie a oe 
then 

(ee)! eA me WICK tiem Lp a5 (2) 
is a parametrization of S. Here s is the arc length of the associated null geodesic 
in the sense of (1). At least locally this is a representation of the characteristic 

Oak) (dx 
surface. In a neighbourhood of a point y¢o, the »—1 vectors eae tanh sae 
k 

|=} are linearly independent. If the null geodesics (1) are elongated, then 


singularities may occur. One example of, this is the vertex of a conoid in the 
Minkowskian space (Fig. 33.7), where o is the surface of a 3-dimensional ball in 


the plane x*=0. 
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caustic 


Sn 
7 ‘ 
Sa Fig. 33. 6 Minkowskian poe Fig. 33.7 


Definition. The caustre (of the Duele surface 8) is the set of all points p for 


Ge) Wie 7) 
which (in local co-ordinates) the vector 35 : 1, I, ce — } are linearly depend- 
eat as on or 


Remark 1. The above example of a cone in the Minkowskian space is not typical. Generally 
the caustic lying on S consists of several (x —2)-dimensional manifolds. The number of these 
(n —2)-dimensional manifolds generally amounts to n —2 (Fig. 33.8). 


wh 
Fig. 33.8 


Remark 2. Tf one deals with null fields in the sense of Theorem 33.1.2/2, then the caustic 
can be constructed on every characteristic surface S’(~) =c. If ¢ is varying then one obtains 
several (n—1)-dimensional surfaces in the general case. 


Theorem. A point p=<x"*(s, Al, ..., A”~*) of the characteristic surface S belongs to the 
caustic if and only if 

ain een: os 

«,B=1 


Remark 8. For the calculation of A(s, A) one has to use local charts. However, if A(s, 4) =0 
holds in one local chart, then this is also true in all other admissible local charts. 


33.1.4. The Caustic in the Minkowskian Space 


Jf M is the (4-dimensional) Minkowskian space of the special relativity 
theory (see, for instance, (24.2.2/3)), then the rays coincide with the straight 


lines which include an angle of a with the x‘-axis. It is our aim to determine the 


caustic of a 2-dimensional surface which lies in the plane x+=0. For that pur- 
pose, some preliminary considerations are of advantage. 


Evolute: The analogue to a caustic for a plane curve in the 2-dimensional 
Euclidean space is the evolute of this curve; cf. Subsec. 17.1.3. 


Principal curvatures. Consider 2-dimensional smooth surfaces in the Euclidean 
3-dimensional space. If x= (x!, x2, x) is a point of such a surface, then let V be the 
associated normal vector. A normal section is a plane through x which contains N 
(Fig. 33.9). In this way a plane curve is cut out. Let x be its curvature in the 
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sense of Subsec. 17.1.3. Now suppose that x be signed, as indicated in Migs .lC: 
If gy has the meaning indicated, then x =x(¢) depends on @. Points with x(v)= 
= const are called umbilical points. If x is not an umbilical point, then there exist 
exactly oue maximum x,=x(¢,) and exactly one minimum X= x(q»), where 


ac 
2 =G,+5.- These two values are called principal curvatures. 


N N 


y XY x20 x40 Fi 
fo) 


Caustic. In the Minkowskian space, consider a 2-dimensional smooth surface 
which lies in the plane x+=0 (Fig. 33.11). Then a is space-like. Let o be free of 
umbilical points, and let it be parametrized by x*=x*(A!, 22), k= 1, 2, 3. Further 
let one of the two characteristie surfaces S be given by (33.1.3/2) with n=4, 
where s = 0 corresponds to the surface c. Let the prinerpal curvatures of o at the 
point (A!, 22) be x ,(41, 22) and x.(A!, 42). Finally, let VN = {N*(A!, 22)}8_, be the nor- 
mal vector, normalized and fixed in its direction, in Ry to ¢ at the point (4!, 22). 


Fig. 33.11 


Theorem. With the above assumptions, the caustic on S consists of two surfaces 
given by 
N¥(a!, 22) 1 


rye DN pl 2) 4 or . 4 2) 
ee ony a SS) oe 


qiene mys 


Remark 1. This is the analogue to the construction of the evolute in Subsec. 17.1.3. 


Remark 2. This construction can be extended to the n-dimensional Minkowskian space, 
nl 


ds?= > (dx*)2—(dz”)2. The caustic then consists of n —2 surfaces. 


p=] 


33.1.5. Discontinuities of Solutions of the Wave Equation; Catastrophes 


Let & be an open connected set in the n-space-time M. Further let Y be an 
(n — 1)-dimensional hypersurface which separates @ into the open sets 2, and Q, 
and Y (Fig. 33.12). Let u(x) be twice continuously differentiable on both Q, and 
5, and, for pc, let 


[u(p)}=lim u(g)—lim u(g) (jump at 2). 
q—p q-P 
gen GeQy 


[vu(p)]is formed analogously. Let Pu be the general wave operator of (32.3.4) oe 
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a 


Theorem 1. 7f u(x) fulfils the above assumptions, if Pu=0 in D’(Q), and if, more- 
over, |[u]l +I[ 7 u]l +0 on 2, then ¥ is a characteristic surface. 


Fig. 33.12 


Remark 1. Consequently, discontinuities of solutions of wave equations can only occur at 
characteristic surfaces. This cannot even be helped by resorting to the theory of distri- 
butions. 


Classical variant: The classical variant of the above theorem reads as follows. 
Let u(x) be a (classical) solution of Pu=0 which is twice continuously differen- 
tiable on Q. Further let w(x) be & times continuously differentiable on both Q; 
and Q., with k=3 and > [D*u]+0 on SY. Then ¥ is a characteristic surface. 
lalJ=k 

Here [D?u(p)] is defined analogously to [u(p)]. 
Theorem 2. Let the characteristic surface © in the domain Q be parametrized accord- 
tng to (33.1.3/2) by xt =a*(s, 4), with A=(Al, ..., 4*-2). If Pu=0 on Q, and if s’ 
and s are any two admissible parameters, then 

. a me 
[w(a(s”’, A))P | A(s’, a)l? =[ule(s’, a))P 1A(s’, I? e (1) 
Remark 2. a; is the covariant version of a!. Further, A(s, A) has the meaning indicated in 


Theorem 33.1.3. For a’=0, which corresponds to the self-adjoint case P=P* of Subsec. 
32.3.4., Hq. (1) assumes a particularly simple form. 


a aux! 8,2 
~F axis, Eas 
sr 5 2 


Remark 3. For a fixed A, c*=2*(s, 4) is a null geodesic. Consequently, discontinuities of 
solutions Pu=0 of the wave equation are transported along null geodesics. 


Catastrophes: Let [w(x(s’, 4))]=0. If s’” moves along the null geodesic x*(s, 4) 
towards a point of the caustic, then it follows that A(s’”, 4) 0 according to Theo- 
rem 33.1.3. From (1) one then finds that [u(2(s’’, 4))]| <>. Hence, if « describes a 
physical process which exhibits a discontinuity along a characteristic surface, 
then a catastrophe will occur upon approach to the caustic. 


nee Fundamental Solutions 


33.2.1. The Problem 


Fundamental solutions of partial differential equations were investigated in 
Sec. 23.1. within the theory of the distributions D’( Ry) (Def. 23.1.1). Such solu- 
tions proved useful, especially in the treatment of initial value problems. Among 


F : : oO : 
the three types considered in that section, A, A 7. and 0, we are now inter- 


ested in the wave operator O. In our present terminology, Theorem 23.1.4. reads 
as follows. In the (4-dimensional) Minkowskian space with the line element 
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ds? = (drt)? + (dr2)? + (das)? — (dat)? 
1 dx’ 
G(¢) = ~ Ae i ra i g(x, sane a; a) ds, > (1) 
iv vi 


where y€D(R,), is a solution of 

CG oe? eG Ca 

a ae a (2) 

(Oxr!)?  (@x?)2— (6a3)2 (Aart)? 
Here Kis the upper part of the light cone (characteristic cone) with origin 0, and 
Ay is the intersection with x4, that means, the 3-dimensional sphere {(x!, x2, 23) | 
(al)? + (2+ (x3)? = (x4)2}, with the surface element ds, (Fig. 33.13). One has 
supp G=K (ef. Remark 23.1.4/1), and G is singular. Our further aim is to extend 
this result to the general wave operator P of (32.3.4/5) in an arbitrary space- 
time. Hence, in particular,from now on we shall always assume that n=4, i-e., 
we shall consider space-times in the sense of the convention made at the begin- 
ning of Subsec. 33.1.1. The adjoint operator of (32.3.4/6) will be denoted by P*. 


Fig. 33.13 


Definition. Let Q be « connected open set in the space-time M. If q€ Q, then G,€ D’(Q) 
is called a fundamental solution of P tn Q 7f 


(PG@q)(¢) =Gq(P*p)=9(q) for all pEeD(Q). (3) 


Remark 1. The formulation is meant to imply that for every point ¢€@ it is possible to 
find a corresponding distribution G,. Referring to the 6-distribution to be considered in 
Subsec. 32.3.1.. (3) then also means that PG,—0, for all g€Q. In particular, distributions 
on manifolds shall always have the meaning stated in Sec. 32.3. Of course the above defi- 
nition can be immediately extended to n-space-times. 

Remark 2, The above definition is the analogue to Def. 23.1.1 for space-times and general 
wave operators. Other than previously, we now consider a family {@g},¢a of fundamental 
solutions rather than a single one. If JZ is the Minkowskian space, and if P= 1, then one 
ean obtain G, from (1) by G,(~) =G(p(q +:)). In the general case, however, such an operation, 
g(x) (¢+x) is not available. Other than in our previous investigations, now all of our 
considerations are local ones. 


33.2.2. Causal Domains 


By Theorem 29.5.5, for every point p of a given space-time M there exists a 
geodesically convex neighbourhood 2. Now it is possible to introduce a time 
orientation in 2 by subdividing the light cones (characteristic conoids) in 2 
continuously and consistently into future cones and past cones (cf. Subsec. 
31.1.1.). To this end one starts the subdivision at a point p,¢€ 2 and continues it 
continuously along the unique geodesic from p, to py (Fig. 33.14). Thus a time 
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Fig. 33.14 D°(q) Fig. 33.15 


orientation is always possible on a local scale, whereas globally it is an essential 
additional requirement. 


Definition 1. Let 2 be a geodesically conver domain of the space-time M, and let 
ge 2. Then (Fig. 33.15) 

D+ (q)={p | pe Q, the geodesic from q to p being time-like and future-directed}, 
O+(q)= {p | pe, the geodesic from q to p being null and future-directed}, 

J+ (q)=Dt (qg)UCt(q) (future emission of q). 

D~(q), C~(q) and J~(q) can be defined analogously. 


= 


Remark 1. If ¢€Q, then D+(q) is the interior of the future conoid with origin g, as far as it 
lies in 2, and C+(q) is this future conoid, as far as it lies in 2. One has C'*(q)=0D*(q) (in 
particular this implies that q¢Ct(q)). 


Definition 2. A connected open set 2 ts called causal domain if it can be imbedded in 
a geodesically convex domain Qy, 2 Qo, and tf, for any two points p,€ Q and py € 2, 
Jt (py)OJ7 (po) ts either a convpact subset of 2 or void. 


Remark 2. A causal domain for p¢€ 7 can be constructed as follows. Let p, and py be any 
points such that p€.D7~(po) and p€D*(p,). For a suitable choice of py and py, D7 (po) ND*(p4) 
is then a causal domain containing p (Fig. 33.16). 


Po 


Py Fig. 33.16 


33.2.3. The Distribution 6q+(L) 


Consider a causal domain 2 in a space-time M. If gé 2 and pe D*(q), then let s 
be the arc length (in the sense of Remark 29.5.4/4) of the unique time-like 
geodesic x(s) that connects p with g (Fig. 33.17). If g=a(s,) and p=2(s,), then 
let. I'(p, g)=|s;—50| be the geodesic distance of the two points p and gq. Let 
(p, q)=0 for peCt(q), and let F.={p|pe€Dt(q), [(p, q)=c} for any <=0. 


NOE r 


e y D*(q) 


0*(q) 


Fig. 33.17 Fig. 33.18 
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(Since q is fixed, we do not note the dependence on q.) In the Minkowskian space 
one has Hie | ele (x2)? + (23)? — (x4)? = 62} for g=0. In the general case I, is 
the diffeomorphic image of such hyperboloids (Fig. 33.18). 


Lemma. /f S(p)=/"(p, q) (where ¢ ts fixed, p variable), then gS 48, =48 in D*(q). 


Remark 1. Hence, in particular, grad S+0 in D+(q). Thus one has the situation described 
in Subsecs. 32.2.4. and 32.3.5. (the domain Q considered there is now D+(q)). 


Definition. [f 6.¢ D’(R) ts the 6-distribution with respect to the point e=0, 7.e., tf 
6.(y) = ple) for pe D(R)), then 6,,(1 —e) =5,(8) tn the sense of Theorem 32.3.5. 


Remark 2. In Theorem 32.3.5. one has to substitute D*(q) for 2. Hence one has 6,,(I"—«)€ 
€D’(D*(q)), and 


Og (DNS. I P(P)\ts for pEeD(D*(q)). (1) 


& 


It is clear that (1) can be extended to p€D(Q). Then 6,4 (1’—«) €D(Q). 
Theorem. 6,,(/°) =i 6,,(1—e) converges in D’(Q). 4 
& 


Remark 3. What is meant is that 6,4 (17) € DQ) and 64, (I’—«)(p) +094(L')(@) for p€ D(Q). 
Remark 4. If ./ is the Minkowskian space and if q=0, then 06,,(/’) coincides with @ of 
(33.2.1/1) except for a factor. Thus 6,,(/°) is the invariant (and local) version of G. 


Remark 5. One may substitute D-(q) for D+(q). Then one obtains a corresponding distribu- 
tion 6- (I). 


33.2.4. Fundamental Solutions 


Theorem. Jf 2 is a causal domain in the space-time M, then there exists, in the 
sense of Def. 33.2.1, exactly one fundamental solution G, of P in 9, such that 
supp @,cJ+(q) for q¢Q. Denoting this fundamental solution by Gey i caneve 
represented as Gj =U6,,(1)+V*. Here 4q,(1°) has the same meaning as in Theo- 
rem 33.2.3. Further, U=U(p, q) is a function differentiable up to arbitrary order 
on 2X 2, and Vt=V+(p, q) is a function defined on 2X Q, with supp V+ CAT = 
={(p,q) | p€ 2, qe, pES*(q)}, which is differentiable up to arbitrary order on At. 


Remark 1. It is possible to formulate a corresponding theorem fora fundamental solution G@, . 


Remark 2. One has supp 6,,(/')=C*(q). The support of the singular part Ud,.(1’) of Ce 
thus lies in C+(g), as in the case of the Minkowskian space, where Gq in essence coincides 
with @ of (33.2.1/1). In the genera] case this (Huyghenian) property is perturbed by the sum- 
mand Vr. 


& 
Remark 3. Let J/ bea space of constant curvature, Le., let Raped =o (GacJba —JocJaa) in the 


sense of Remark 30.2.1/3. Further let P= 0+6, ie., a’ =0 in (32.3.4/5). Then 
R 1 for péJ*(q), 
ee Hy ,(P), where Wy, (L)= 0 fermped (a): 


1 
pe see 
Gq = 487 


6q+(L) oF 


R=0 is the Minkowskian space, that is, Gy =G of (33.2.1/1) for g=0. 
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33.3. Solutions of Pu=f, Cauchy Problems 
33.3.1. Past-Compact Sets and Distributions 


We identify the causal domain Q of the space-time M by a local chart. This is 
not a restriction, since 2 is part of a geodesically convex domain, and such do- 
mains can always be represented by a single chart. G7 shall have the meaning 
stated in Theorem 33.2.4. 


Lemma 1. (a) Jf p€ D(Q), then G7 (p), as a function of q on Q, ts differentiable wp to 
arbitrary order, and supp Gj (p) CJ~ (supp ¢). 

(b) Lf K and K’ are any two compact subsets of 2, and if N =O, 1, 2, ..., then 
there exists a positive number c=c(K, Kh’, N) such that, for all pED(Q) with 
supp eck, 


> sup IDIGi(wlse > sup  |Dzo(p)| . (1) 


jaisaN Q€K (Ed aie rao al C40) 
Remark 1. If K is a set in 2, then J-(K)= U Ju(q) and Jt(A)= U J*(q) (Fig. 33.19). 
qek qe 


Further, D; has the usual meaning, the differentiation being with respect to qg=(q!, ..., g”), 
The statement made on the support of Ga (gy) in part (a) is an immediate consequence of the 
fact that supp Gy cJ*(q). This also explains the restriction p€AK 1 J+(K’) in (1). 


Definition. (a) A set A in Q ts called past-compact if J~(p)\ K is etther void or 
compact for all pe Q (Fig. 33.20). 


J*(g) 


p K 
Lp 0 
Ieee, 33) 11) Ties, B30) 


(b) D*(Q) as the class of all functions differentiable up to arbitrary order on Q 
which have a past-compact support. 
(ce) D’*(Q) is the class of all distributions in D’(Q) with a past-compact support. 


Remark 2. Every compact set is past-compact. It is, however, easy to find sets which are 
past-compact but not compact. D’*+(Q) is not the space dual to D+(Q). Functions in D+(Q) 
need not have a compact support. Consider, for instance, the causal domain Q = {x | |a4| <2} 
in the Minkowskian space, then K = {x | |v4|=1} is a past-compact set, and it is easy to state 
C@-functions which have the support K. 


Lemna 2.77 A vs compact and k’ 7s past-compact, then J~(K)Q K’ is compact. 
Remark 3. It can also be shown that J+(K’) is past-compact if K’ is past-compact. 
Theorem. /f {¢.D’*(Q), then f[G7]¢D’(Q), where f(G7\(~) =HGP(~)) for pe D(&). 


Remark 4. Tf p¢D(Q) with supp pC K, then, by Lemma i(a), supp Ga (g) ela (St ie is 
compact, then it follows from Lemma 2 that J~(K) Msupp f is compact too (Fig. 33.21). The 
theorem is then deduced from (1), with K’=J~(K) (supp f, and from the known rules of the 
theory of distributions. 
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Fig. 33.21 


33.3.2. An Existence and Uniqueness Theorem 


Consider the general wave operator P of (32.3. 4/5) on a causal domain 2 of the 
space-time Jf. If pe 2, then I'(p, g) has the meaning indicated in Subsec. 33.2.3., 
where now we extend I(p, q) to all points q with q+. If x*(s) is the unique ge- 


odesic that connects p with qg, where s is a geodesic parameter in the sense of Sub- 
A da* 
sec. 29.4.5., with p=-x'(0), then VI(p, q)=2 lel mas (Vv is apphed with re- 


‘spect to q). Thus in particular one has VI'(, q) +0 for ¢+ 7p, and it 1s possible to 
apply Theorem 32.2.4/1 with S(q) =I'(p, q) (pfixed). Hence u(q) =d of (pd) Aur(q), 
where we have indicated the dependence on q for the sake of learmess (Fig. 


33.22) 
& ae 


Fig. 33.22 


Theorem. If f€D’*(Q), then Pu=f has a unique solution in D’*(Q). This solution 
is u=f[G7], and one has supp ucJ* (supp f). Jf fe D*(Q), then u(p) belongs to 
D*(Q) and can be represented as 


1 
[00,9 fo) eats [V0.9 Ko) ua) (1 


u(p)= aE 
c~@ 


ade 
Remark J. U(p, g) and V+t(p, qg) have the same meaning as in Theorem 33.2.4. Further one 
has to understand f ... as lim f ... In the Minkowskian space with the usual co- 
C~(p) e}0 C~(p) 
; _ lsl=e  datAda2Adzs : 
ordinate representation. yy coincides with == armen apart from a factor. In this case 


it is easily seen that the above limit exists. On a local scale, the general case is the diffeomor- 
phic image of the situation found in the Minkowskian space. Hence it follows that the 
above integral exists in the general case, too. 


Remark 2. The fact that w=/[Gj] is a solution of Pu=f follows from 


(Puy(p) = ul P*p) = f(Gq(P*9)) =H) 
(cf. (32.3.4/6), Theorem 33.3.1 and Def. 33.2.1). The statement made on the support of « 
follows from the construction of /[ ral 


Remark 3. In the case of the Minkowskian space with the usual line element ds? =(daz!)? + 
da!Ada2A dx} 
xh 
suitable constant. Then w(p) of (1) is identical with the well-known retarded potentials (cf. 

19.3.5/1) and the first term of (23.2.2/2)). 


+ (da?)2+ (da?)2—(dat)?, one has V+=0, U=1i, and wr=c¢ , where c is a 
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33.3.3. The Cauchy Problem: Existence and Uniqueness 


The Cauchy problem (initial value problem) for the wave equation in the 
Minkowskian space (in our present terminology) was dealt with in Subsecs. 
19.3.5. (classical solution) and 23.2.2. (solution by distributions). It is our aim to 
extend the previous considerations to arbitrary space-times M (that means, to 
4-dimensional Lorentz-metric spaces) as well as to the wave operators P of 
(32.3.4/5). Here we confine ourselves to C-functions, i.e., to classical solutions. 


Cauchy data: Let 2 be a causal domain in a space-time M. We shall again as- 
sume that Q is identical with its local chart. Let S be a 3-dimensional (smooth) 
hypersurface in Q (Fig. 33.23). If u is a C?-function on S, and if ¢ is a direction 
: Ou. ; wo Clee . 
tangential to S, then — is known if w is. Further, if oe is given, where p is 
p 

not a tangential direction, then all first-order partial derivatives of won S are. 
known. We then say that Cauchy data are given on S, 


Theorem. Let 2 be a causal domain, and let S be a past-compact space-like 3-di- 
mensional hypersurface in Q, such that 0J*(S)=S (Fig. 33.24). Further let f be a 
C»-function in Q, and let C°-Cauchy data be given on S. Then the Cauchy problem 


Pu=f in Jt(S), u being a C°-function in J*(S) , 
ulgand Vulg assuming given values on S , 


has one and only one solution. 
J*(9) 


S=9I (5) 


g 
ieee 
Fig. 33.23 Fig. 33.24 


Remark t. The terms “space-like” and ‘‘past-compact” have been described previously 

(Def. 33.1.1 and Def. 33.3.1). As previously, J+(S)= U J+(q), and 9J+(S) is the boundary 
. . ges 

of this set, where only points of the open set Q come into question as boundary points (and 


not points of 02 in the sense of the embedding of Q in the R,,). The statement that u is a 
C@-function in J+(S) means that w is differentiable up to arbitrary order in the interior of 
J*(S) and that all of its derivatives can be continued continuously to the boundary of J+(S). 


Remark 2. The assumptions made with respect to S are natural, especially the requirement 
that S be space-like. If 2=M is the Minkowskian space, and if P= 0 and S={x | =i. 
then the above theorem in essence coincides with the existence and uniqueness proposition 
- of Theorem 19.3.5/2. Our aim is to generalize the explicit construction of the solution as de- 
scribed in that latter theorem to our present case. 


33.3.4. The Cauchy Problem: Representation 


We make the same assumptions as in Subsec. 33.3.3.: Let Q be a causal do- 
main in the space-time M, which is identical with its local chart (Fig. 33.25). Let 
S again be a past-compact space-like 3-dimensional hypersurface in Q, with 
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eJ*(S)=S8. For péeJ*(S), p¢S, we introduce the following notations: 
Jet (5), C,=C (pnd *(s), 
8,=JI-(p)S, o,=C-(p)NS. 


Then C,US, =D, (four-dimensional), and 6, =0S, (three-dimensional). o, isa 
»-dimensional closed space-like compact submanifold. By Theorem 33.1.2/1 and 
Remark 33.1.2/1, there exist exactly two characteristic surfaces that contain 
og, ; one of them is C~(p), the other is denoted by 7. Let « = u(q) be the invariant 
measure referred to in Theorem 32.1.4. On C,, let wr=pr(q) be the Leray form 
mentioned in Subsec. 33.3.2. Finally, let the 2-form de, be the invariant meas- 
ure on ¢,, where the metric in a, is induced by the metric in M. If 7 is a contra- 
variant vector, then let *7 be the 3-form stated in Theorem 32.2.2. In the sense 
ol oulbsets o2-2.5., let (*n)s_, be the restriction of *7 to S,. 


Dilatation factor: According to Subsec. 33.1.3, 7, can be generated by the null 
geodesics which pass through o, (Fig. 33.26). Let Eq) be the contravariant 
tangent vector to these null geodesics, which is normalized by Zi@)(VF(p, = 
= —1(¥V isapplied with respect to q¢; see also Subsec. 33.3.2.). ['(p, g) =e cuts a 
2-dimensional subset out of 7. If w is an open 2-dimensional subsct of 
Gp, thena set w, is cut out of 7,1 {P(p, =e} by the null geodesics which gener- 
ate 7, and pass through w. If da; is the invariant measure on 7,19 {(/'(p, q) =}; 
then on a, there exists a function © (dilatation factor) such that 


ae A i 7 ee 
lim a ( faa; — | ds, = - | Odo, , 


Iep,gi=e 


We 


oT 


ao: Fig. 33.26 


All symbols used in the following theorem shall have the meaning indicated 
above. P is the general wave operator of (32.3.4/5). U=U(p,q) and Vt= 
=\'+(p, 7) have the same meaning as in Theorem 33.3.2 and are regarded as 
functions of q. 
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Theorem. If u is differentiable on Q up to arbitrary order, and if pe J*(S), pé 8, 
then u(p) can be represented as 


u(p)=u(p) + up) + up) , 


cl il 
up) = — i U Puurt 5 { PIS 
Qa 27 
Cy Dp 


1 
Up) = 5 i (*7)s,, Where = {y*}, 
27 - 
Sy 
nt = Vtg 7 uw) — ug" 7 Vt)pt+ ak V +4 > 


1 
UuO(p) = ae f [205*(7 uj, +OUu tab, Uu+ Vtul de, . 


°p 


Remark 1. Tf one substitutes f for Pu and the given Cauchy data for u and Vu on 8, then 
one obtains a representation of the unique solution « of the Cauchy problem of Theorem 
33.3.3. Under the conditions of Theorem 33.3.2, with /€D+(Q), one in essence obtains 
(33.3.2/1). The above theorem (being interpreted as a solution of the Cauchy problem) also 
generalizes Theorem 19.3.5/2. In that case Q=M is the Minkowskian sone, JP = (OL, Yo =), 
U=i, a’=0, and u2)p) =0. 

Remark 2. For the determination of u(p) one has to know Pu in D, as well as the Cauchy 
data on S,. If V* is not identically zero, then the knowledge of the Cauchy data on a, will 
not suffice (otherwise one has to do with the Huyghenian property mentioned in Subsec. 
19.3.4.). Thus living in a curved space-time will not be as pleasant as in the Euclidean space. 
In the sense of the interpretation in Subsec. 19.3.4., a sound wave in a curved space-time 
will never die out completely. 


33.4. Tensor Wave Equations 


33.4.1. Definitions 


So far we considered the wave operator P of (32.3.4/5) in D’(M). In this sec- 
tion, too, M shall always be a space-time (i.e., a 4-dimensional Lorentz-metric 
space). Our previous investigations and many of the results can be transferred to 
tensor distributions in the sense of Def. 32.3.2 and to corresponding tensor wave 
operators. As tensor indices can be raised or lowered at choice, we shall confine 
ourselves to purely covariant and purely contravariant tensor distributions. The 
covariant derivative V and the coderivative 6 shall have the same meaning as in 
Def. 32.3.3/2. 


Definition. Let M be a space-time, and let m=0, 1, 2, ... For TED Om M) we 
define OT = —6(g* 7 T);) (tensor wave operator). 


Remark 1. From 7'¢ D’©:™( J) it follows that 7 Z'€ p/m + DM), 97 7), €D'™( M1), and 

OT = —6 (9g 7 7),)€D’O.™(M). If m=0, ie., if T EDM) =D‘ M), then O7' is the 
wave operator of Def. 32.3.4. Naturally the above definition can be immediately extended 
to an arbitrary n-space-time. 


Remark 2. Tf 7={7 Bese Rigg (©)F is a tensor with components differentiable up to arbitrary 
order (we recall that we always speak of tensors rather than, more precisely, of tensor 
fields), then 

(ID) i ky = GOD by digg 8) =I 


milk « 
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Remark 38. The definition is also applicable if T €D’)(J/). If, for instance, g = {p* thm} € 
€D(™,9)( M), then, by analogy with Remark 2, 

( Cg) iF = git htm y, 7 (1) 

Pieced 
Lemma. Let {6 ! ees be a tensor of type (m, m) with components differentiable 
up to arbitrary order. If T={T yr, }€ DOM), and if p= {p lm) ¢ Domo M), 
then 
Pyeel S00 oes 

CeO eee mnie) = 2 (bee yp }). (2) 

Remark 4. With the use of (1), the lemma can be derived from Def. 32.3.3/2. Here 
Pape ; 2 are 

te Mg Beg UT oot) ED’Om(M) is the tensor distribution whose components are 
OO aE ae By analogy, one has {BT tm, is ptm} € DOO) M). 
Remark 5. By analogy with our previous approach, for T €D’O.™)(M) we set 

POT NN eG Aicrs Eee (3) 
The lemma then shows that for g€.D@)( Af) the adjoint operator P* can be calculated from 

P*y a | p ae (Cy era 1 rr} : (4) 
Then (2) reads (PT)(y)=7(P*q). Here the co- and contravariant components are inter- 
changed when P is replaced by P*. 
Remark 6. By analogy with (32.3.4/5), it is possible to generalize PT of (3) by setting 

PT=O0T+ (OMe, a V Per. tt + RL a 


Hoye hs i eee €D’\0.m)( MM). Here a and 6 are tensors with components differentiable up to 


arbitrary order. But (3) will suffice for treating Maxwell’s equations in space-times. There- 
fore in the sequel we shall confine ourselves to this case. 


33.4.2. Fundamental Solutions 


The operators P and P* shall have the same meaning as in Subsec. 33.4.1., 
formulas (3) and (4). By analogy with Def. 33.2.1 and Theorem 33.2.4, the 
problem is to find fundamental solutions. 

Definition. Let 2 be a connected open set in the space-time M. If q€Q, then Gy= 
(Gp is called a fundamental solution of Pin Q if Gar = Cs € D’O.m) M) 
and 


Tg oee8 Peel T 400? 
(2G Voy=G,- “(hb y)—p “(a) 
for p€ D!™)(Q) and for every fixed m-tuple of indices, 14, +5 Tm Further, Chae 
with the m-tuple of indices 84, ...) Sm being fixed, vs supposed to transform as a 
contravariant tensor with respect to the variable q and the indices 74, «+, Vm 
Remark 1. For the scalar case m=0 this coincides with Def. 33.2.1. 


Bi-tensors: Let PMG AGB q)} be a system of functions on Q>*Q2 (or on 


MxM). Thent is called a bi-tensor (more precisely, a bi-tensor ficld) if, for fixed 
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Ty oeeT Us oct . . . 
PONG Fis Tae | pe is a UD q)} is a covariant tensor (with respect to 


q and 8), ..., 8m) and if t,,.. (¢)= ee q\\, tom timed and ee. 16 a 
contravariant tensor (with respect to p and 74, ..., 7m). A corresponding defini- 
tion is also possible in cases where the numbers of 7- and s-indices are not equal, 
or when mixed-variant tensors occur instead of co- and contravariant ones. G, is 
the distribution variant of a bi-tensor. 


Theorem. /f 2 ts a causal domain in the space-time M, then, in the sense of the 
above definition, there exists exactly one fundamental solution G, of P in 2, such 
that supp G,cJ*(q) for g¢Q2. Denoting this fundamental solution by GZ, one can 
represent it as 

ioe Pye? T poo? 

Detain Goa) ae (Gnd) (1) 


G Spo8yy — "8 [+S 9 eed 
Here 64,(f) has the same meaning as in Theorem 33.2.3. Further, Coe q)} 
ee Le ®, q)} are bi-tensors, and fa Ds q) ws differentiable wp to 
arbitrary order on 2X Q. If At has the same meaning as in Theorem 33.2.4, then 
supp = oe Aad VS ee as differentiable on At up to arbitrary order. 


and 


Remark 2. supp G,cJ*(q) means that supp Gg ho F+(q) (regarded as a covariant tensor 
distribution) holds for every m-tuple of indices 7, .:., 7m. The theorem is the analogue to 
Theorem 33.2.4. 


33.4.3. Solutions of Pu=7 


Theorem 33.4.2 is the analogue to Theorem 33.2.4. This makes it possible to 
transfer the theory of solutions presented in Sec. 33.3. for scalar wave operators 
to the case of tensor wave operators. This applies to both the analogue to Theo- 
rem 33.3.2 and the Cauchy problems described in Subsces. 33.3.3. and 33.3.4. 
Here we shall formulate the analogue to Theorem 33.3.2 and dispense with a 
treatment of Cauchy problems. We again consider a causal domain Q in a 
space-time M. Further, P shall have the same ineaning as in (33.4.1/3). 


Theorem. Jf f€D'”(Q) has a past-compaet support, then Pu =f has eractly one 
SOLULTON U = {Us 4 .s90 €D'\™)(Q) with past-compact support. This solution is given by 
up) = Gg (9) =fIGy EP "™(p)}) for pe DI™(Q) (1) 


and supp ucd ‘(supp f). Lf, om addition, the components of f are differentiable on Q 
up to arbitrary order, then the components of u in (1) are also differentiable on Q Up 
to arbitrary order, and 


1 "Past ' 
yt (P)= Z| TEMPO) fryearg (@) r(0) (2) 
C-(p) 


1 en, 
+e f VOID D frpergld) (0) - 


Remark. The theorem generalizes Theorem 33.3.2. Past-compact sets have been considered 
in Subsec. 33.3.1. So f belongs to the tensor variant of D’+(Q) as per Def. 33.3.1(c). Further, 


+ : ° * . 
Gq and the functions in (2) have the same meaning as in Theorem 33.4.2. To prove that 


Be 33.5. The Maxwell Equations SOS 


oO gg 


u€D'™ (QO), one has to verify that the correspondence pe D™ %Q Q) = f(Gq (@) is a tensor 
distribution of D’@,™(Q). This is done in the same way as in aft. Batol o ene itis 
easily seen that u in (1) is a solution: for mp €.D(™")(Q) one has 


(Pu) (p) = ul P*y) = f(Gq(P*)) = 19) 
ef. Lemma 33.4.1, Def. 33.4.2. and Theorem 33.4.2. 


33.5. The Maxwell Equations 


33.5.1. Definition 


As usual, a 4-dimensional Lorentz-metric space will be called a space-time. 


Definition. Let M be a space-time, and let o (current density) be a given vector whose 
components are differentiable on M up to arbitrary order. Desired a 2-form F 
(electromagnetic field) which satisfies 


dF=0 and 6F=co0 (Maxwell equations) , (iy 
and whose components are differentiable on M up to arbitrary order. 


Remark 1. As a 2-form, / ={/;7} is a covariant tensor which, by Def. 32.1.2 and Lemma 


32.1.2. satisfies the (necessary and sufficient) condition that F,,;= —F,. By Lemma 32.1.3 
the rast equation in (1) then means that the 3-form 
AF = Fy mdz™ Adz* Adz'=0. (2) 


In view of index shifting we do not distinguish between co- and eontravariant components 
of a tensor. In the second equation in (1) one has to interpret / as a contravariant or mixed- 
variant tensor. Then by Def. 32.3.3/1 one obtains 


(Oho, (3) 


Remark 2. In the above definition we have eonfined ourselves to vectors o and forms F 
which are differentiable up to arbitrary order. In our further treatment we will follow this 
line. But it is also possible to extend the Maxwell equations (1) to tensor distributions o and F’. 


.Remark 3. Poincare’s theorem in Subsec. 32.1.5. shows that locally dF =0 is equivalent 
with F—dd, where A is a veetor. Hence loeally the Maxwell equations (1) are identieal 
with 
Fyy=Agy—Are » (4) 
— (Vig| Oat —o* 4 (5) 
I9| 
This links our considerations with Subsee. 30.1.3. In the sense of Remark 30.1.3/3 one 
thinks of the Einstein-Maxwell field as being decoupled: the geometry of the space is deter- 


mined locally by heavy masses and globally by the world models considered, and an elec- 
tromagnetie field to be investigated docs not influence this geometry. 


33.5.2. Continuity Equation and Cauchy Data 
Lemma. The current density o of the Marwell equations (33.5.1/1) satisfies the 
continuity equation d0=0. 
Remark 1. The Lemma is derived from ##! = — F*, (33.5.1/5) and Def. 32.3.3/1: 
0= (Vig) FY) a=W 1gl of) e=Vig, oF p= —V igi 00 - 
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Cauchy problem: The Maxwell equations in the Minkowskian space were dealt 
with in Sec. 25.3. We shall attempt to translate the problem setting considered 
there to the case of arbitrary space-times. As previously we shall take the exam- 
ple of the wave equation as a guide to our approach (cf. Subsecs. 33.3.3. and 
33.3.4.). As in these subsections, let 2 be a causal domain in a space-time M, and 
let S be a 3-dimensional past-compact space-like hypersurface in Q, with J *(S) = 
=S (Fig. 33.27). For pe J*(S8), p¢S, let D, and 8S, have the same meaning as in 
Subsec. 33.3.4. The classical theory of Sec. 25.3. as well as physical reasons sug- 
gest the following question: if o is given in Q, and if F is given on S, will then 
the Maxwell equations (33.5.1/1) have one and only one solution F in J*+(S) with 
initial data prescribed on S? It would further be desirable that at the point p 
this solution F depends only on o in D, and on F onS, (the domains of depend- 
ence being analogous to those in Subsees. 19.3.4. and 33.3.4.). 


p 


Fig. 33.27 


Cauchy data: For the Cauchy problem to be posed reasonably, the following rela- 
tions must be satisfied on 2 in addition to the continuity equation 6c=0: 


(dF)(q)=0 and (8F)(q)=o(q) for ges. (1) 


For the discussion of these conditions we choose normal co-ordinates in the 
sense of Lemma 29.5.6/2 at the point g=0. Further let S be part of the plane 
{v4 =0} in a neighbourhood of 0. In view of (33.5.1/2) and (33.5.1/3), the relations 
(1) then reduce to 


Fievm(0)=0 and F*’,,(0)=0*(0) . (2) 


Since for the moment we assume that F is given only on S, only 2 of these 8 
equations will result in additional conditions for the initial data, namely 


#0) 0) sands 7,(0)—a4(0) Sas welllacmei =e (3) 
The remaining 6 equations read 

Pyv(0) = — Pean(0) —Fyia(O) ; 

F*",,(0) = 0%(0) — F%4,,(0) — F**,9(0) —F,3(0) , 


where a and b run from 1 to 3. If F* are known on S, then these equations make 
it possible to calculate the derivatives of F”’ with respect to x‘. As the deriva- 
tives of F*' with respect to x!, x2, x} are also known, this means that all the 
partial first-order derivatives of F"’ on S are known. By iteration it then follows 
from (4) that all the partial derivatives of F” on S can be determined. In the 
sequel we shall speak of Cauchy data for the Maxwell equations if on S there is 
given an antisymmetric field 7*"(q), that means, F'"'(q) = — F¥(q), which satisfies 
(1) (or (3) in the canonical form) and whose partial derivatives on S are all caleu- 
lated from F" by the above method. 


(4) 
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Remark 2. If one identifies F*! with the electric field strength © and the magnetic ficld 
strength ¥, as was done in Subsee. 24.3.3., and if o4= —@ (charge density), then (3) reads 
(div B)(0)=0 and (div ©)(0)=@(0). 


This corresponds with the conditions in Subsec. 25.3.2. 


Definition (Cauchy problem for Maxwell equations). Let 2 be a causal domain in the 
space-time M, and let S be a 3-dimensional past-compact space-like hypersurface in 
2, with eéJ*(S)=S. Let Cauchy data differentiable up to arbitrary order, F(q)= 
= {F*'(q)}, be given on S for the Maxwell equations. Further let o = {a*}, with 60 =0, 
be given in J*+(S), with components o* differentiable wp to arbitrary order. Desired a 
2-jorm F with components differentiable up to arbitrary order, which is a solution of 


di) gee. oF =o Win J*(S) 
and assumes the given initial values on S. 


Remark 3. This is the analogue to the Cauchy problem for the Maxwell equations in the 
Minkowskian space, as stated in Subsecs. 25.3.1. and 25.3.2. As was done there, a correspond- 
ing problem can also be formulated here on the basis of the theory of distributions. 


33.5.3. Gauge Condition and Four-Potential 


As has been stated in Remark 33.5.1/3, dF =0 is locally equivalent to F =dA, 
where the 1-form A is called a fonr-potential. The Maxwell equations of (33.5. 1/1) 
then reduce to 6dA =o. s 


Cauchy data: Consider the same situation as in Def. 33.5.2. Let again 2 be a 
causal domain in the space-time M, and let S have the properties stated there. 
Let F(q) be Cauchy data given for the Maxwell equations on S. Without loss of — 
generality we assume that S be part of the plane x*=0. Then the construction of 
A described in Remark 32.1.5 shows that, with the Cauchy data F'(q) being given 
on S, the components A, of the four-potential A as well as all partial derivatives 
of these components on S are known, too. If A; as well as all the partial deriva- 
tives of A, on S are calculated from the Cauchy data F(q) in this way, then we 
say that the Cauchy data for the fonr-potential A on S are given. 

Gauge condition. According to Remark 32.1.5/2 it is possible to replace A, 
with F=dA, by A+dB, where B is a function. Now it is not very difficult to 
see that B can be so chosen that (6A)(q) = —A*.,(g)=0 for géS (gauge condi- 
tion). From now on we shall always assume that A satisfies the gauge con- 
dition on S when the Cauchy data are given. 


Lemma. Written in component notation, the reduced Maxwell equation 604A =o 
reads 

o,=(0A), + (6A) n+ WA; , (1) 
where R', ts the Ricci tensor as per Def. 29.5.7. 
Remark 1. The term (6A) .,= —A!., isannoying. Without it, Eq. (1), that is, (0A), + As, 


would be a tensor wave opcrator of the form (33.4.1/3) applied to A ={Az}. The question is 
whether the libertics onc has in the choice of A can be utilized to neutralize the tern (04) x4. 


Definition (Cauchy problem for the four-potential A). Let Q be a causal domain 
in the space-time M, and let S be a 3-dimensional past-compact space-like hyper- 
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surface in Q, with @J*(S)=S. In J+(S), let a= {0} be given, with 60 =0 and with 
components differentiable up to arbitrary order. Further suppose that on S there 
are given Cauchy data differentiable wp to arbitrary order for A={A,}, which 
can be determined from corresponding Cauchy data F(q) for the Maxwell equations 
in the sense set out above. Suppose that the gauge condition (6A)(p) =0 be satisfied 
for pES. Desired are vectors A differentiable wp to arbitrary order, with 


G=—(OA), nA, “nd @A—05 in Aer (2) 
such that A, and A,.. assume the given initial values on S. 


Remark 2. (2) is the decoupling of (1). In the general case the system (2) is overdetermined. 
With arbitrary initial data being given on S, the first equation in (2), as a tensor wave 
equation, has a unique solution, The question is whether this solution also satisfies 0A =0. 
It is clear that this can be the case only under special conditions. 


Theorem. The Cauchy problem for the four-potential A (tn the sense of the above 
definition) has a unique solution A ={A,}. If peTt(S), pS, then A,(p) depends 
only on the values of a, in D, and on the values of F'™ in S >. 


Remark 3. D, and 8, have the same meaning as in Subsecs. 33.3.4. and 33.5.2. As was al- 
ready indicated in Remark 2, one first considers the Cauchy problem for the tensor wave 
equation 

(OA)p + BY As =oy « 


The tensor analogue to Theorem 33.3.3 shows that this problem has a unique solution. This 
solution A; also depends on o; and F"” in the form stated above. The proof that this solution 
automatically satisfies 6A =0 in J*(S) is based on the gauge condition (6A)(g) =0 for gES8 
and 0¢=0 in J+(S) and on the specific choice of the Cauchy data for A. 


33.5.4. The Cauchy Problem for the Maxwell Equations 


Theorem. The Cauchy problem for the Maxwell equations in the sense of Def. 
33.5.2 has a unique solution F={F yy}. If peIt(S), pes, then Fii(p) depends 
only on the values of ¢, in D, and on the values of F,, in S,. 


Remark 1. D, and S, again have the same meaning as in Subsecs. 33.3.4. and 33.5.2. The 
theorem is reduced to Theorem 33.5.3 by setting #=dA. Then df =0., The right-hand side 
of (33.5. 3/1) is identical with 0/. From this, together with 64 =0, it then follows that 6F =a. 
This proves the existence of the solution and the above-mentioned dependence of F,)(p) on 
the initial data. The uniqueness proposition is also reduced to Theorem 33.5.3. 


Remark 2. The theorem is the analogue to Theorem 25.3.2/1. As in that case, it should also 
be possible here to state the solutions explicitly and to extend the considerations to distri- 
butions. 


eee 34.1. Local Mappings 403 


34. Singwarity Theory 


34.1. Loeal Mappings 
34.1.1. Germs of Mappings, the Ideal a2() 


Let C»(R,, R,) denote the class of all mappings y=f(r) of R, into R; which 
are differentiable up to arbitrary order. All the functions and mappings to be 
considered in this chapter are real-valued. 


Definition 1. Two mappings of C°(R,, R:) are said to be equivalent if they coin- 
cide in a suitable neighbourhood of the orrgin (in R,). The equivalence classes 
with respect to this equivalence relation are called map-germs (or simply germs). 
The class of all germs is denoted by e(n, l). 


Remark 1. The above equivalence relation {/~g has the usual properties: 1. f~f, 2. f~g im- 
plies g~f, and 3. f~g and g~h implies /~h. Henee the formation of equivalenee classes is 
meaningful. From now on we shall no longer distinguish between equivalence classes and 
their representatives. Hence we speak of germs {/€C™(£,,, &,) ete. (This convention is analo- 
gous to the notation f¢L,(#,,). In this case, too, f is identified with the equivalenee class of 
those functions whieh coincide with / almost everywhere.) All statements to be made in this 
chapter are loeal ones (if not explicitly stated otherwise) and refer to germs. 


Remark 2. Instead of considering functions differentiable up to arbitrary order, we might 
also consider continuous, differentiable, or analytic functions, and form the corresponding 
equivalence elasses. 


Definition 2. Let e(r)=e(n, 1), 
m*(n) = {f | fee(n), (D7f\(0)=0 for lalsk—1} 
for k=1, 2, 3,...and m(n)=m!(n). 


Remark 3. In this definition we have already made use of the above convention to identify 
equivalene classes with their representatives. Thus the elements of e(n) are function germs 
(or germs in the sense of the above terminology). 


Theorem. e(n) is a commutative ring with an identity element and a unique maximal 
ideal. This maximal ideal is m(n), and is generated by x4, ..-, Xn: 


Remark 4. Addition and multiplication of germs arc carried out point by point, being inde- 
pendent of the choice of the representatives. Since f-g=g-/f, e(n) isa commutative ring. 
The identity element is f(z) =1. Further, h(n) is ealled an ideal if fEA(n) and g€e(n) implies 
fg€h(n), which is true for h(n)=m(n). An ideal h(n) is called maximal if h(n) does not coin- 
eide with e(n) and if there is no ideal different from e(n) which is properly greater than ’(2). 
The theorem asserts that A(n)=m(n) is the only maximal ideal in e(n). The assertion that 
m(n) is generated by 2, ...,%, means that /€m(n) can be represented as 


f(a) =5 xpt,-(%), where «,(x) €e(n). 
r=1 


Remark 5. Correspondingly, it can be shown that m*(n) is an ideal in e(n) which is generated 


n 
by the polynomials 21! ... ain, where «= (a4, --., &») is a multiple index, |a| = 4 pp =lNe 
‘3 
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34.1.2. Finitely Determined Germs 


Definition. (a) The germs fEm(n) and gem(n) are called equivalent if there exists 


Ol ee 
a germ y= (xr) Een, rn) such that p(0) =90, en (0) +0 and f(x) =g(y(x)). 
TE NS 
ey n) ws called k-deternined, with k=1, 2, 3,...,2f every germ gem(n) 
u a (D*g)(0) = (D*f)(0) for lal =k is equivalent to f. 
(e) fEm(n) is called finitely determined if there exists a number k such that f ts 
h-determined. 


Remark 1L. All these statements arc to be understood locally, including the representation 


f(x) =g(p(2)). As usual, _ =e is the Jacobian of y=y(x)=9(x) as defined in Subsec. 
: i ee IY 4s -++ Yn) 
i) 5s jRe : =i) 5 

8.2.1. The requirements that y(0) =0 and Ec 
to one and differentiable up to arbitrary order, of a neighbourhood of 0€ R, onto a neigh- 
bourhood of 0 € R, (cf. Subsec. 8.2.2.). Such a mapping can also be interpreted as introducing 
new, curvilinear co-ordinates in a neighbourhood of 0€ #,. The co-ordinate surfaces are then 
yx(“) =const. In this sense, f and g are said to be equivalent if f can be obtained from g by 
reparametrization. 


(0) +0 mean that g(x) is a mapping, one 


og 
Oxy (0) 
f(x) =«,. From this normal form one can easily qenee that every two germs g and h with 
gem(n), hEm(n), gém(n), and hém2(n) are equivalent. Hence it follows in particular that 
gem(n) with g¢m(n) is 1-determined. Hence (b) and (c) are of interest mainly for singular 
germs f€m2(n). 


n 
Remark 2. Tf gém(n) but g¢m2(n), that means, if 4 >0, then g is equivalent to 


Remark $ (examples). If n=2, then f(x, y) =x? is not finitely determined. This can be seen 
by comparing the zero manifolds of f(a, y) ee and f(x, y) =2?~y?4. For n=2, f(x, y) =22+y3 
is an example of a 3-determined germ. 


34.1.3. Criteria for Finitely Determined Germs 


it ecs) then ler 


g w r,) 
Ge={a! geet SCHISM AES 2 2 ae 2) eA} (1) 


; ap 
denote the ideal generated by in e(n). We recall the previous convention 
Xk ; 


that we do not distinguish between germs and their representatives. So the 
above representation for g(r) needs only to hold locally in a neighbourhood of 
OC Ry. 


0 
Remark 1. For the example f(x, 2) =x} +2} of Remark 34.1,2/3 one has PY ay, 2). 
Theorem. f€ e 7s pe y determined if and only tf there exists a natural number 
A such that m*(n =) J 
Remark 2. The theorem is not trivial. But the fact that such inclusions might play some role 


can be seen as follows. If g €m(n) is expanded in a Taylor series at 0, then the initial terms of 
the Taylor series for f and g are compared with each other on the basis of Def. 34.1.2(b). 
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while the remainder of the Taylor series for g. which belongs to m*(n), is captured by f accord- 
ing to the theorem. 


Remark 8. The theorem is derived from the following two partial assertions, which give a 


sharper formulation as well. (a) fis k-determined if m**!'(x) cman. (b) Tf fis k-deter- 
; n) c OL 
mined, then m*+1(2) minh, Here m(a)() is the set of all products of elements of 
Ox /af\ Ox 


of 
: 2 ; = anes Tf) ee > 
m(n) and G.): by analogy with m2?(n) Gee 

Definition. /f f€m2(n), then cod f (codimension of f) ts the minimum number 
of germs gi€m(n) withl=1,..., N, such that every germ g € m(n) can be represented as 

ANY n of 
g=2, agit > a(x) =—, areal, ap(x)€e(n) . (2) 

St K Oxr 


If N=cod f, then gq, ..., gx ts called a basis of m(n) with respect to a ; 
bs 


Remark 4. Thus in the formal notation one has cod f=dima, min) (2), and gj, -... 9x 


0 
with V =cod f is a basis of minor). Note that the a@)’s in (2) are real numbers. 


Remark 5. In the definition we implicitly assumed that there exists a natural number 
: : : 7) 

with the above-mentioned properties. If m(n) =(*), then we set cod {f=0. If there is no 

number N =0, 1, 2,... in the sense of the definition, then we set cod f=. 

Remark 6. For n =2 one has cod (x2y) =~, that is dimp, m(2)/<xy, 22) =, because the germs 

y, y2, y, ... cannot be captured by finitely many adjunctions of suitable germs g,;. For n=2 

one has cod (x3 + y°) =3, that is dimp, m(2)/(x?, y2)=3, where x, y, ry is a basis. 


34.2. Stability 


34.2.1. Definitions 


Definition 1. (a) Global diffeomorphism in Rp. A one-to-one C*-mapping h(x) of 
R, onto Ry ts called a diffeomorphism. 

(b) Local diffeomorphism in Ry. A C=-mapping h(x) of Ry ito R, ws called a 
local diffeomorphism at x9€ Ry if « neighbourhood of x? vs mapped one to one onto 
a neighbourhood h(x) Rp. 

(c) Diffeomorphism on manifolds. A homeomorphic mupping h of an (n-dimen- 
sional orientable) C*-manifold M onto itself vs called a diffeomorphism if h(x) 
ts a Om-mapping in every local chart. 


Remark 1. The term “mapping of 2, onto #,”” means that every point of the £#,, is an image 
point (hence the range of such a mapping is the whole £,,). If this is not ensured, then we 
speak of a ‘‘mapping of /,, into R,. By a Ce-mapping we mean a mapping h(x) =(Ai(2). --- 
h,(x)) whose components are all differentiable up to arbitrary order. Part (b) ineans that 
p(x) =h(a + 2°) —h(x) Ee(n, n) 18 a mapping with the properties stated in Def. 34.1.2. 
Manifolds Af as well as homeomorphic and diffeomorphic mappings on manifolds have been 
defined in Subsec. 29.1.2. 
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Remark 2. If /(x) = (/,(2), ..., f)(x)) isa mapping of R, into R;, then from now on we shall set 
l 


ey Ife(x)| and (D*f) (x) =(D*f,(x), .... D%fr(x)) . 
=| 
Definition 2. (a) A O*-mapping fx) of Ry into R; is said to be stable if there exist 
« positive number e and a natural number k such that, for every C°-mapping 
g(x) of Ry trnto Ry with 

sup > |D%f(r)— D%q(x)|<e 


TER, lalsk 


there exist a diffeomorphism h, in Ry und a diffeomorphism hy in R, for which 
the diagram 


eae 
hy} { hy (1) 
Mp a Ry 


7s commutative. 

(b) A C=-mapping f(x) of Ry into R, is said to be stable at the point x € Ry if 
there extsts u positive number ry, such that, for every ball K,={x|lx—2x | <r}, 
where O<r<ro, there extst a positive number = e(r) and a nutural number k=k(r) 
with the following property: for every C”-mapping g(x) of Ry into R; with 

supe 2 Wi) D79(a)| =e 

tEK, |e sk 
there exist a local diffeomorphism h, at x°€ Ry and «a local d iffeomorphism hs at 
f(x) eli, such that the diagram (1) is locally commutative. 


Remark 3. The commutativity of (1) means that g(hy(%)) =ho(f(x)) for e€ R,, which is also 
written as goh,=hyof. h,~! isalsoa diffeomorphism if his. so that (1) can also be written as 
g=h,ofohy~'. In other words, gy is obtained from f by reparametrization (introduction of 
curvilinear co-ordinates) in R, and R). 


Remark 4. (b) is the local variant of (a). The family of balls AK, with 7-0 is necessary be- 
cause it is desirable to use only local properties of f at x9 (or of comparison mappings g). The 
local variant of the diagram (1) is as follows. The local diffeomorphism hy maps a neighbour- 
hood U of x, with UCK,, onto hy(U) CK, f and g map U and h,(U) into sets in R (Fig. 
34.1). Finally, ho maps f(U) diffeomorphically onto ho(f(U)), and the above-mentioned rela- 
tions hold true (Fig. 34.2). Thus g is obtained locally from f by reparametrization. 


Fig. 34.1 


c P Ga 
Cad, 3s 
an 


GM =h(fU)) Fig. 34.2 
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Remark 5. The above concept of local stability suggests itself on the one hand, but it is also 
rather general on the other. We shall consider several specializations. One spceial case is 
Def. 34.1.2(a)."There one has 1=1, hy(%) =@(a), with g(0)=0, and Ay is the identity. 
Remark 6. The stability of C°-mappings between manifolds can be defined in an obvious 
way. 
Remark 7. Regular mappings that are stable will be considered in the next subseetion. Thus 
for the moment we confine ourselves to some singwar mappings whieh will play an essential 
role later on. (The terms “‘regular” and “singular” will be explained later.) (a) f(#) =a is a 
stable mapping of R, into R, (where n=l=1). (b) (Whitney's cuspidal point of the second 
kind). 

ula yy=zy—e,  eley=ye (2) 
is a stable mapping of Ry into Ry (n=1=2). (e) f(x) =a is a mapping of A, into R, that is 
not stable (xn =[=1). f(x) is not even locally stable at 2° =0. From the assumption of loeal 
stability at 0 it would follow that there exist two local diffeomorphisms h, and hy such that 
gohy=hoof for g(x) =x3 +ex. The chain rule then shows that (h20/)’ has exaetly one zero in 
a neighbourhood of 0, whereas (g 0/,)’ with e<0 has two zeros. This is a eontradietion. 


34.9.2, Tmmersions and Submersions 


If f(x) =(f,(x), «5s fr(x)) is a C*-mapping of R, into FR), then let 
aden 
Gx, xn 


an eh 
Ce, Ce, 
be the associated Jacobian matrix. If n=/ and det (/f)(x°)+0, then it follows 
from Theorem 8.2.2/1 that f is a local diffeomorphism at x°. Hence it is at least 
plausible that the rank of /f is of essential interest in the investigation of local 
mapping properties. : 
Definition. (a) r°¢ R, ts called a regular point of f if the rank of (J f)(xo) ts maxi- 
mum, t.e., of rank (Jf)(x°)=min (n, £). 

(b) f ts called an immersion if nSland f 1s regular at every point xe Ry. 

(ec) f ts called a submersion ifnZl and f is regular at every point x€ Rp. 


(J f(x) = (x) 


Remark 1. The definition is of a local nature. By means of loeal eharts it can immediately be 
extended to mappings between manifolds in the sense of Def. 29.1.2. 


Remark 2. A elosed, smooth curve without double points ean be immersed in very complex- 
structured sets in the plane (Fig. 34.3) (here n=1 and 1=2 in the sense of the above defini- 
tion and of Remark 1). On the other hand such a eurve cannot be immersed in R;. The global 
propertics of immersions may be very complicated. The definition (as well as its extension to 
manifolds) covers regularity only on a local scale. 
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Theorem 1. Jf x° is a regular point of f in the sense of the above Definition, then f 
ts stable at the point x in the sense of Definition 34.2.1/2(b). 


Remark 3. For =I this is again casily dedueed from Theorem 8.2.2/1. 


Remark 4. The converse of the theorem is not true, as is shown by the examples given in 
Remark 34.2.1/7. 
Remark 5. The theorem ean be strengthened substantially. If 2° is a regular point of /, and if 
n=l, then there exists a local diffeomorphism g at /(z°)€ R; such that, in a neighbourhood U 
of x°, 

(gof)(x) =(%, sory Uns OO) te CG, OD) 
(loeal normal form for immersions). If 9 is a regular point of f, and if x =/, then there exists a 
local diffeomorphism hf at «9¢€ &, sueh that, in a neighbourhood U of 2°, 


(Gi OLN @)S (Gain fo ox Bp) Oe PEW (2) 


(local normal form for submersions). The theorem is a simple implication of (1) and (2). 


The space C~(R,, R,): As in Subsec. 34.1.1., O-( Ry, Ry) consists of all mappings 
of R, into R, that are differentiable up to arbitrary order. Elements of 
O (Ry, Rj) ave added and multiplied by real numbers in a point-by-point manner. 
One then obtains a linear space. C-(R,, Rz) becomes a Hausdorff space as 
defined in Subsec. 29.1.1. if the open sets are chosen to be arbitrary unions of 

{g| sup >) {D*f(x)—D*g(x)| <e} . (3) 

TERY {el|ssk 

ere 0. /€C-( iy, bi), goC "tn, Jy), and h— Ose (cl hema s4:2.1/2)) 
Such spaces are also called locally convex. If in the sequel we speak of sets dense 
in C»(R,, R,) then this will always be meant in the sense of the above topology. 
If N and / are any two manifolds of the dimensions n and /, respectively, then 
one can define C7(N, L) in a quite analogous way, and make it a Hausdorff 
space. 


Theorem 2. /f /=2n, then the class of all immersions of Ry in Ry is dense in 


CA fins Ri) : 


Remark 6. Theorem 2 remains true if &, and A; are replaced by, respectively, any n- and 
i-dimensional manifold. In any case the immersions form an open set. 

Remark 7. The restriction that /=2n is annoying. Later on we shall deal with problems of 
denseness for two cases whieh are not ineluded in the above theorem: (a) /=1, » arbitrary 
(Morse’s theory), (b) 7 =!=2 (Whitney’s theory). Tt will then turn out that in these cases 
sets dense inC~(R,,, 7} ean be eonstructed only if one also admits special singular mappings. 


34.2.3. Global Theorems 


The terms “‘stable” and “generic” are fundamental for the singularity theory. 
Local and global stability has been dealt with in detail in the preceding sub- 
sections. The term “generic”’ is not so clearly defined. We shall use it here in 
the following sense: desired are sets in C°(A,, R;) (or, more generally, in 
C~(N, L), where N is an n-dimensional, 1 an /-dimensional manifold in the 
sense of Def. 29.1.2) with two properties: (1) These sets shall be dense in 
C? (Ry, Ri) (orin C*(N, L)). (2) The elements of these sets shall have as many good 
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properties as possible. The meaning is clear: it is not desirable to struggle with 
pathological phenomena of isolated mappings if in an arbitrarily close neigh- 
bourhood (with respect to the topology of C@(R,, R)) or C°(N, L)) there are 
mappings with much better properties. For the applications of the theory of 
singularities to problems of physics and biology, this point of view is reasonable. 
The two theorems stated in Subsec. 34.2.2. show that locally stable mappings 
are generic, provided that /=2n. It would be desirable to have analogous propo- 
sitions on global stability and criteria for global stability. The following example 
shows that the situation on the global scale may be very complex (Fig. 34.4). 
Let n=1 and /=2 in the sense set out above. A closed C*-curve N without 
double points can be immersed by f in L= R, in the manner indicated. Slight 
modifications g of f (in the sense of C~(N, L)) may lead to qualitatively new 
situations. The figures show that /, though being locally stable, is not globally 
stable, because double points, points of contact etc. are invariant under diffeo- 
morphic mappings. This will raise the question for one-to-one immersions and 
whether they are generic and globally stable. 


Z 
IRS 


N 
Fig. 34.4 


Theorem. Let N be an n-dimenstonal, and Lan l-dimensional manifold. Further 
let lL2n. 

(a) If N is compact, then every one-to-one immersion vs globally stable. 

(b) If N 7s compact, and if 1=2n +1, then fEC(N, L) is globally stable if and 
only tf f is a one-to-one immersion. ’ : 

(ec) If 1=2n+1, then the class of all one-to-one immersions of N in L 1s dense 
pC (hy 2). 
Remark 1. We shall mainly be concerned with local properties. Global propositions like the 
above theorem will only play a minor role in the sequel. 


Remark 2. In part (c) it is not required that NV is compact. In particular it is thus possible 
that V=F, and L= R;. However, if N is compact in addition, then the one-to-one Immer- 
sions form an open set in C@(N, L). 


34.3. Singularities and Morse Functions 


34.3.1. Sineularities 


In Def. 34.2.2(a) we stated when x€ Ry is a regular point of feC°(Rn, Mi). 
f and Jf shall now have the same meaning as m that definition. 


Definition 1. 7°¢ R,, is called a singular point of fEC@( Ry, Ri) if the rank of 
(Jf)(x) 7s not maximal, that is, if rank (J f)(x°) < min (n, 2). 
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EE EE a eee 
Remark 1. The definition can be immediately trausferred to fEC*(N, £), where N is an 
n-dimensional, and ZL an /-dimensional C~-manifold. 

Remark 2. If {¢C"(R,, R,) is a function (i.e., /=1 in the above definition), then x°¢ R, isa 


0 
singular point of f(#) if and only if oF am =()) stot (R= Ih, coag Hs 
f 


Remark 8. For the examples f(a) =x? and f(v) =z given in Remark 34.2.1/7, x =0 is a singu- 
lar point (here n=/=1). For the example (34.2.1/2) (where n=1=2) one has 


hte =(4" ae 


Hence a point (x, y)€ Ry is singular if and only if it lies on the parabola y =322. 
Definition 2. 7f fcCv(R,, R,) (%.e.,=1 tn the sense of Def. 1), then xR, ws 
ee el ee 
called a non-degenerate critical point of f if x9 is a singular point (i, af Eas) = 
k 


62} n 
joel, und det ( (220; 


62,02)] 1 =1 


is also called Hessian matrix. 


2 n 
Remark 4. ( Al 


Ox,OL 1] k1=1 
Remark 5. For /(z)=22, x=0 is a non-degenerate critical point. For f(z)=23, x=0 is a 
“degenerate” critical point (here n=1). 


34.3.2. Morse Functions 


Definition. (¢C°(R,, Ry) ts called a Morse function if every point x R,, ts ether a 
regular point or a non-degenerate critical point of f. 


Remark 1. /(z)=« and f(x) =? are Morse functions, whereas f(x) =a? (with »=1) is not. 


Theorem. (a) The Morse functions are dense in C*(R,, Rj). 
(b) A Morse function ts stable at every point x€ R,. 


Remark 2, Denseness in C*(R,,, R;) is to be understood in the sense of the topology defined in 
Subsec. 34.2.2. Local stability has been defined in Def. 34.2.1/2(b). For regular points, part 
(b) is already known; ef. Theorem 34.2.2/1. 


Remark 3. The property of being a Morse function is generic in C( R,, R,) (in the sense of 
the terminology used in Subsec. 34.2.3.). Other than in Theorem 34.2.2/2 (mappings that 
consist of nothing but regular points are generic in the case ]=2n), now one has to admit 
certain singular points in addition to regular ones in order to get functions with generic 
properties. It is clear that it will be attempted to keep these singularities as innocuous as 
possible. This leads to the concept of the non-degenerate critical point of Def. 34.3.1/2. 


Remark 4 (normal forms). If #¢€R, is a regular point of f¢C(R,. R,), then, by Remark 
34.2.2/5, in a neighbourhood of 2° it is possible to obtain the normal form (foh)(z)=a,, 
where A isa local diffeomorphism. If z°€ #,, isa non-degenerate critical point of f/€C°(R,, RP). 
then in a neighbourhood of x° it is possible to obtain the normal form (foh)(x)=a}+ 
Ebon +a; —ajp,,;—...—22, where k is a suitable number, /:=0, ..., , and A isa local diffeomor- 
phism. 2 


Remark 5 (global version of the theorem). f€C™(R,, R,) is globally stable if and only if f is a 
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Ee 
Morse function whose values at the singular points are pairwise different (this means that 


f(a!) + f(x?) if x! and x? are singular points of f and x!+2?), Since later on we shall mainly be 
interested in local effects, we have formulated this interesting proposition only as a remark. 


34.4. Mappings in the Plane 
34.4.1. Good and Excellent Mappings 


We consider mappings f€C™(f», Ro) of the plane into itself. Here we write 
j= (ulx, y). v(x, y). Let 


jOu OU 
ee Or oy _dudv dudv 
Ce = av bv | det (J f)(x, Uae eee 
‘Ox OY 


be the Jacobian matrix defined in Subsece. 34.2.2., and the Jacobian determinant, 
respectively. 


Definition 1. f€C*( Ry, Ry) rs called a good mapping tf 
O é 
ldet (JA(x, y+] = det (Jp(x, y) [+ | 5 et Ale 1) |=0 


at every point (x, y)E Ry. 


Lemma. [ff is a good mapping, then the set of singular points consists of C”-curves 
‘in By which do not intersect one another. 

Remark 1. By Def. 34.3.1/1, (z, y)€ Ry is a singular point of fEC~(R,, R,) if and only if 
det (Jf) (x, y) =0. Let y(t) = (y(t), yo(é)) be a smooth parametrization of a C=-curve of singular 


points of f. 
! 8 . 
Definition 2. (a) Let f be a good mapping, and let y(t) be a curve of singular points. 


p=ryto) ts-called a fold point if 


d 
qy flv slto)s yo{to)) +0 . 


p=rylto) ts called a cusp point if 


~~ 


oy 


d d 
qf ilto), yolty)) =O and de (7 1(ty), Yo(to)) £0 - 


(b) A mapping fEC?( Rs, Ry) ts said to be excellent if f is « good mapping and 
its singular points ure etther fold points or cusp points. 


Remark 2. By 0 we always mean 0€ Ao. It is easily verified that the definition of the fold 
points and cusp points is independent of the manner in which the curve y(¢) is parametrized. 


Remark 3. Cusp points are always isolated points. If p is a cusp point on y(é), then the adja- 
cent ares of the curve (t) in a neighbourhood of p consist of fold points alone (Fig. 34.5). 


i) 
party CY 
PX, Fig. 34.5 
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34.4.2. Normal Forms of Fold Points and Cusp Points - 


Theorem. Jf f=(u(x, y), v(x, y)) ts an excellent mapping, then on the local scale 
it 1s possible to obtain the following normal forms by reparametrization: 

(a) u=x, v=y, where (0, 0) ts a regular pornt. 

(b) w=2?, v=y, where (0, 0) is a fold point. 

(ec) u= —ayt+2", v=y, where (0, 0) is a cusp point. 


Remark 1. Here reparametrization means the local variant of (34.2.1/1), where hy and hy are 
local diffeomorphisms of the z,y-plane and the u,v-plane, respectively. The point to be in- 
vestigated is mapped onto (0, 0). 


Fold: The theorem suggests that the mappings in (b) and (c) should also be 
investigated globally. The mapping f=(u, v) with w=.2x? and v=y is called the 
fold. One has 

2x 


0 0 ; 
Me M=(G G)> det Mle M=Be, Fdet (Ale w=2- 


fis an excellent mapping. The points (x, y) € Ry with x +0 are regular. The points 
(0, y) are singular, being fold points. / folds the x,y-plane, mapping it onto the 
half-plane {w=0, v}. 


Cusp (Whitney’s cuspidal point of second kind, cf. Remark 34.2.1/7): The mapping 
f=(u, v) with «= —ry+x3 and v=y is called the cusp. One has 


= Bg 
Chiles n=( a ay det (J/)(«, y)=322—-y, 


«act (Jf\(x, y) = Gx, F det Jf/(a,y)=—-1. 


fis an excellent mapping. The singular points lie on the parabola y=322, which 
can be parametrized by y(t) = (t, 3@) (Fig. 34.6). Then one has f(y(t)) = ( — 288, 322) 
as well as 


d d2 


(y(é)) =(— 62, 6t) and qe flr) =(— 126, (OMe 


dt 


Fig. 34.6 


Hence the points with ¢+0 are fold points and the point with 1=0 (that means 
‘ Q)) is an (isolated) cusp point. The mapping is folded along the parabola 


= 3x". The inage of this parabola is the semicubical parabola (or Neil parabola, 
» 38 


or cuspidal point of second kind) w= tt ; 
/33 
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34.4.3. Whitney’s Theory 

Theorem. (a) The mapping fEC*(Rs, Ry) is stable at every point (a, y)e Ry tf 

and only vf f ts excellent. : 
(b) The excellent mappings are dense tn C™( Ry, Ra). 


Remark 1. The denseness in C=( Ry, Rs) is to be understood in the sense of the topology defin- 


theorem with the considerations described in Subsec. 34.3.2. and with Theorem 34.2.2/2. 
Excellenee is a generic property of mappings of the plane into itself. 


Remark 2. (global version of the theorem). fEC@( Ry, Ro) is globally stable if and only if f is 
excellent and has the following additional property: the images of the C@-curves consisting 
of fold points intersect at most pairwise but not at zero angles. Further, the set of images of 
the fold points and the set of images of the cusp poimts are disjoint. 


eee Unfoldings 
34.5.1. Definition 


Besides the terms ‘‘stable” and “generic”, unfolding is the third fundamental 
concept of singularity theory and catastrophe theory. 
Definition. Jf f€m(x), then FEm(n+r) ts called unfolding of f if F(x, 0)= f(x) 
for x€ Ry and O€ R,. r ts called codimension of the unfolding. 
Remark 1. m(n) and m(n+r) have the same meaning as in Def. 34.1.1/2. Henee it ts also 
clear that all considerations are local, referring to a neighbourhood of 0€R,,,. Thus the 
germ f(x) €m(n) with x€ R, is extended to R,,, by F(x, uyem(n+yr), with ce R, and u€ R,. 
F(x, 0)=f(x) should also be understood in this sense. The values of r may be r=0, 1, 2. ... 


As the codimension r will play a major role later on, we also write (/’, 7) instead of P. There 
is no fear of confusion with F(x, u). 


Remark 2 (examples). (1) If f(z) €m/(n), then 
F(z,u)=f(z) for x€R, and uch, 
is the constant unfolding. (2) If f(x) €m(n), then 


F(a, u) =a) + 
kh 


By(a) wp, dp Een), w=(Uy, +» Ur) , 


Ms 


is an unfolding Fém(n+r) of f. 


Remark 3. The meaning of this concept can be described as follows. In the theory of catas- 
trophes, physical or biological processes that are discontinuous are described by unstable 
germs /€m(n): extremely small C=-perturbations effect substantial changes of the topologi- 
eal strueture of the germs (the unperturbed and the perturbed germ can no longer be trans- 
formed into one another by reparametrization). One attempts to imbed this isolated catas- 
trophe event in an 7-parametric process which is described by a “structurally stable poten- 
tial” F(z, wu). Catastrophes then oecur at isolated points of the otherwise structurally 
stable process. If a catastrophe occurs at a certain position of the Ry and at a certain time, 
then the process can be imbedded in a four-dimensional space-time, r=4. All considerations 
to be made are local, 
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34.5.2. Associated and Equivalent Unfoldings 


It is our aim to find minimal stable unfoldings of a given germ f¢€m/(n). For 
that purpose it is necessary to define several new concepts. Two germs /€ m(n) 
and gém(n) will not be judged essentially different if they are equivalent in the 
sense of Def. 34.1.2(a). 

Definition. Let (F,7) be an unfolding of f€m(n), and let (G,s) be an unfolding 
of gem(n). 

(a) (G, 8) 2s called associated to (F, r) if there exist germs 

Pee(n+s,nt+r) with O(0)=0, wee(s,r) with y(0)=0, 
and 2€m(s) which have the following two properties: 
1. O(r, u)=(y(a, w), plu); cE R,, ue Ry, y(x, u)€ Rn, plu) € R,; 
2. (a, u)=F(y(x, u), p(u)) + a(u). : 

(b) (@, s) and (F,r) are called equivalent (or isomorphic) if (G@, 8) is associated 

to (Fr), r=s, and if p(u) vs a local diffeomorphism in R, and y(x,u), with u 
being fixed, vs a local diffeomorphism in Rp. 
Remark 1. The requirement 1 means that @(x, u) is a fibre preserving mapping. The fibre 
R,, marked by ‘‘w’’ is transformed into the fibre R, marked by “(wu)” (Fig. 34.7). For a fixed 
fibre w one then has y(x, u)€C™(R,y, R,). The requirement 2 means that G can be reduced to 
F’, For u=0 one obtains 


g(x) =G(x, 0) = F(y(x, 0), 0) +0 =f(y(x)). (1) 


Hence, in particular, the germ g can be reduced to the germ f. 


Fig. 34.7 


Remark 2. In the case (b), (1) says that f and g are equivalent in the sense of Def. 34.1.2(a). 
Further, in case (b) it is easy to show that the definition is symmetrical with respect to 
(@, s) and (F, 7). This justifies the terms used. 


Remark 3. If one compares requirement 2 with (1), it appears desirable that a(w) =0. But 
the difference is small, being required for technical reasons. A much more general version is 


G(x, u) =A F(y(x, u), p(u)), u], where A(t, u)€e(1+s), Alt, O) =e . 


which has, for instance, been considered in [65], p. 33. 


34.5.3. Stable and Universal Unfoldings (Definition and Examples) 


Definition (a) An unfolding (F,r) of f€m(n) ts called a stable unfolding of f 
af every unfolding (G, s) of f ts associated to (F, r). 

(b) An unfolding (F,r) of fEm(n) is called a universal unfolding of f if (F,r) 
is a stable unfolding of f and if r is minimal. 
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Remark 1. Hence (/’, r) is called a stable unfolding if every other unfolding (G, s) of / can be 
reduced to (/, r). In particular this means that small perturbations of (Ff, r), provided they 
are unfoldings of /, are reducible to (F’, r). 

Remark 2 (examples). 1 [If f€m(2) is regular, i-e., if f¢ m2(2), then (/’, 0) =f is the universal 
unfolding of / (there is nothing to be unfolded). This follows from Remark 34.1.2/2 (here 
a(u)=0 and y(w)=0 in Def. 34.5.2). Thus the concept of unfolding is only of interest for 
singular germs /€m2(n). 2. For f=x2em2(1), the codimension of the universal unfolding is 
yr =0, just as in the first example. This can in essence be dedueed from Theorem 34.3.2 and 
Remark 34.3.2/4, because x? is a Morse funetion. 3. According to Remark 34.2.1/7, f(a) = 
=x3€m3(1) is not stable. Hence it cannot be expected that 7=0 is the codimension of a uni- 
versal unfolding. It turns out that F(x, vw) =a3 —xu is a universal unfolding of f(z) =a? (thus 
7+=1 is the codimension of a universal unfolding). This is largely an implication of Theorem 
34.4.3, because F(a, w) corresponds to Whitney’s euspidal point of second kind, which is an 
excellent mapping in the plane. 


34.5.4. Stable and Universal Unfoldings (Criteria) 


If F(z, u)em(n+r), where xé€R, and w=(ay,.., u)€R,, is an unfolding of 
f(x) €m(n), then we set 


ui oF 
Vp={ao+ Do Op = CRD real} <e(n) : 
k=1 OU; 


Theorem 1. (a) f€m2(n) has a stable unfolding if and only af f is finitely determined. 
(b) The unfolding (F, r) of f€m(n) is stable if and only of 


\ . 
e(n)=(2)+ Vee . (1) 
: : of : 
Remark 1. Finitely determined germs were considered in Subsec. 34.1.2. ~) was defined 


of 
in Subsec. 34.1.3. Eq. (1) says that it must be attempted to extend a) to e(n) by the 
adjunction of finitely many germs. 


Theorem 2. If f€m2(n) is finitely determined, then cod f, as introduced in Def. 
34.1.3, is the codimension of a universal unfolding of f. 


: at\ 
Remark 2. If g;, .... gr, Where r=cod f, is a basis of miny/(oe) in the sense of Def. 34.1.3, 
then 
3 
F(z, u)=f(z)+ > uxge(%), where Zen, and ¢—(y, -4 Ue 

k>1 
is a universal unfolding of the finitely determined germ {¢€m?(n). This follows from Theo- 
rem I. 
Remark 3. In accordance with the examples given in Remark 34.5. 3/2 one has cod 2?=0 
and cod #3=1 (where »=1). 


34.5.5. Reduction of Unfoldings 


Let G(x, u) be an unfolding of g(x) €m(n), where ué R,. In the later applica- 
tions, #, will be the control space with u as a control parameter, and R,, will be 
the phase space with x as a phase or state parameter. G(r, u) then is the poten- 
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tial which describes the process in question. We are interested to know the 
local relative minima of G(r, wu) with w being fixed (the corresponding x-values 
describe the possible states). As «+0, the system tends towards a catastrophe. 
Thus we look for x-values such that 

ac 8G . | 

= (Alo, 1) rea, eCeL, (1) 
Ox OX 
where all considerations are confined to a neighbourhood of 0¢€ R,4,;. What is 
of real interest are the minima of G(x, u) alone, so that any procedures which do 
not alter the topological structure of the set of all these minima are admissible. 
Let 


Ui ) Ed 
A(x, yy=Glx)+ DB yeem(n+q) - (2) 
k=l 
Meni 2h, yen, we, and ved. 


@ f 
Gla, y, u, v) =G(a, wt 2 Ys (3) 
1 ‘ 
is an unfolding of 4, with the codimension r +s, which is the constant unfolding 
with respect to v€ fs, in the sense of Remark 34.5.1/2. (wu, v)€ R,+5 are now the 
control parameters. The minima of G in (3) for fixed (u, v)€ Ry +s are identical 
with the minima of G, provided that y,;= ... =y,=0. 


Definition. Let (G, r) be an unfolding of g€m(n), and let (F, r+s) be an unfolding 
of fem(n+q). Here q=0 and s=0. 
(a) F reduces toG if F is equivalent toG in (3) (in the sense of an (r + 8)-unfolding). 
(b) @ ts called u proper reduction of F if F reduces toG and q+s~0. 
(¢) F as called irreducible if F has no proper reductions. 


Remark. The starting point lies in the germ / and its unfolding F’. Desired are simple proce- 
dures which on the one hand do not alter the structure of the set of all minima, while redue- 
ing the number of phase parameters and the number of unfolding parameters on the other 
hand. This means that it is desirable to get rid of superfluous parameters which do not 
influence the qualitative picture of the set of all minima. It follows from Remark 34.5.2/2 
that in case (a) the germs / and h in (2) are equivalent (in the sense of Def. 34.1.2 (a)). So one 
attempts to transform /into the form (2) by means of a local diffeomorphism, whereafter one 
can omit the uninteresting parameters y;. ..., y, and confine oneself to g. The second proce- 
dure consists in omitting those unfolding parameters in which the unfolding is constant. 


34.5.6. Minima 


v7 
Simple minimum: If f€m2(r) is equivalent to Q(x) = >, 2; in the sense of Def. 


34.1.2(a), then, by Thcorem 34.1.1, sr 
Bie 
(PV a (ERY (054 ony ty) = (0). 


From Theorem 34.5.4/2 it then follows that f is its own universal unfolding 
(f is a Morse function, cf. Def. 34.3.2). If (7, r) is an unfolding of f, then it is 
easily seen that (/’, 7) is equivalent to the r-dimensional constant unfolding of f. 
In this case F is said to have a simple minimum at the point 0. 
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ee Let (F,r) be an unfolding of fEm(n). Then F has a local minimum 
at OC Ry+, tf for every neighbourhood W of 0€ Ry +, there exists a potnt up R, 
such that, for wy being fixed, the function F(x, u) has a local minimum in 
W 1) {tr = weg} 
= Fr es . . . . . . . it 
Remark. If (#, 7) has a simple minimum (i.c., if fCo(m) is equivalent to > a2), then F has a 
' » . . . . k ze ! 

local minimum in the point 0. In this case one ean always choose uj =0. But what we are 
interested inare just those cases in which (x, 0) has no minimuin in the point 0€ R,, whereas 
there are minima in an arbitrary x.u-neighbourhood of O€ Ry, ,,. A typical example is 


faj=xe8em) and F(x, u) =r'+um . 


34.5.7. Thom’s Theorem 


Theorem. Let f€m2(n) be a finitely determined germ. Let (F, 7) be a stable unfolding 
of f, with codimension r=4, which has a local minimum at 0. Then F either has a 
simple minimum in the potnt 0 or F reduces to one of the following 7 irreducible 
(canonteal) unfoldings Gy of the germs gr 


Name Germ 9x | Unfolding Gy | Codim. 7 
Fold ' 93 v4 ux i 
Cusp x’ ei — yet toe 2 
Swallow tail nee 2+ ux + ye? + wr 3 
Hyperbolic | 
umobilic ae ety +wry —ux—vy ne 
Elliptic ee 5 Pe) : nae 
umbilic ae a ey eee) 3 

| Ux —vY 
Butterfly ie 26 + tart + uxt + vx? + wx 4 
Parabolic Pee. ae ae ‘Z 4 
umbilic ae, x-y 7 a 

—ux —vy 


eee ee 


Remark 1. All the terms used in this theorem have been defined previously: finitely deter- 
mined germs in Def. 34.1.2, stable unfoldings in Def. 34.5.3, the local minimum at 0 and the 
simple minimum in Subsec. 34.5.6., reduction of unfoldings in Subsec. 34.5.5. The three 
umbilies also answer to the following nicknames: wave crest (hyperbolic umbilie), hair 
(elliptie umbilie) and mushroom (parabolic umbilic). The poetic names will become clear 
when we shall draw some figures later on. 


Remark 2. This is Thom’s famous list of the seven elementary catastrophes. They form the 
foundation of catastrophe theory. The faet that G; is a universal unfolding of gy is rather 
easily verified on the basis of Subsce. 34.5.4. What is surprising with this list is its finiteness 
and its independence of the number x of the phase parameters. With the assumptions of the 
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theorem, it turns out that only one or two of the n phase parameters are essential (in the 
sense of the reduction described in Subsec. 34.5.5.). This means that the plausible presump- 
tion that the list will grow longer and longer as » increases is not true. 


Remark 3. The assumptions that (/’, r) is stable and has a local minimum at 0 are natural. 
In the applications, (7, r) is a “structurally stable potential’, and the stable states of the 
system are described by the minima of the potential in the sense of Subsec. 34.5.6. The 
restriction to r =4 is reasonable in biological applications, because in such cases the unfolding 
parameters (control parameters} are identified with a space-time, while x¢€ R,, describes, say, 
the biochemical state of a cell. But ¢ >4 is of interest, too, especially in physical applications. 
What is unclear for the moment is the role played by the assumption that / be finitely deter- 
mined. As we shall see later on, this requirement is of generic nature, being useful for appli- 
cations. 


Remark 4. If one admits other codimensions 7, then the following picture results: 


Codim. r of the unfolding | 1 | 2 3 | 4 Be 
2 = = — 
Number of types el z aa fe pee mile 


Here the simple minimum has not been taken into account. The first cases can be taken from 
the above theorem. 


Remark 6. The first step in the proof of the theorem is the so-called ’splitting lemma‘‘, a 
refinement of the normal forms for Morse functions as per Remark 34.3.2/4: if 0 is a degener- 
ao 
; le 

f(z) is equivalent to D epxup+y(%-44, .., Ly), where r=0, ..., n—1, e, =+1, ype m3(n). 

k=] 

Remark 6. The presentation of this chapter follows the lines of [40]. We further refer to 
[7, 65]. Complete proofs of the above theorem have been given in [68, 78]. , 


ate critical point of f€m-(n), that is, a singular point with act ( (0) =0, then 
ue 


35. Catastrophes: Theory and Application” 


35.1.  Prineiples and Models 
35.1.1. General Principles and Fundamental Ideas 


A detailed consideration of the possibilities of catastrophe theory can be 
found in the famous book of R. Thom [65]. The mathematical basis is the singu- 
larity theory presented in Chap. 34, especially Theorem 34.5.7. The aim of 
catastrophe theory is the mathematical description of discontinuous processes 
(especially in physics and biology), e.g., of the transition of a gas into the liquid 


') René Thom suggested that the combination of singularity theory and its applications 
should be called catastrophe theory. 
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phase, the buckling of an elastic rod, or the specialization of cells in a tissuc. We 
shall order the heuristic considerations to follow by some striking keywords. 
Our aim is to make plausible that Theorem 34.5.7 is a useful eer for 
modelling discontinuous natural processes. 


Potential: Given a physical, chemical or biological system that depends on 
finitely many control parameters u¢ R, (control space), e.g., a thermodynamic - 
system (with temperature and pressure as control parameters) or a living eell 
(with space and time as control parameters). Let the state of the system be 
described by n state parameters x€ 2, (phase space), e.g., the volume of a thermo- 
dynamic system of the biochemical state of a living cell. It is assumed that the 
system can be described by a potential / = F(x, u), which assigns a real value 
F(x, u) to every (admissible) u€ A, in the control space and to every (admissible) 
state r¢€ R,, in the phase space. For a given ué R,, the possible stable states are 
determined by the relative minima of F(x, uw). This is a well-known procedure, 
which is analogous to the classical field theory. Naturally, the requirement that 
a potential exists involves a restriction, which can in part be relaxed if one uses 
the modern qualitative theory of dynamic systems; see for instance [8, 27]. If 


uc R, is fixed, then a potential F(x, u) generates a so-called gradient field, 
dx; oF 
ape, on) anes, oa) [eae AL. cosy VL - 


The following considerations can at least partially be applied to potentials or 
gradient systems or even to more general dynamic systems, 


diy _ 

dt 
cf. [65]. But here we shall confine ourselves to potentials where recourse to 
dynamic systems is not necessary. 


NCO t,t): Kale. at, 


Structural stability: It is reasonable and plausible to require that small pertur- 
bations of the potential F(x, «) of a real system do not change the topological 
character of F(x, u). In particular, small perturbations of the potential are 
assumed to cause only small changes of the possible stable states of the system 
(that is, of the relative minima of F(x, w) with wé R, being fixed). This (some- 
what imprecise) requirement is called structural stability. If a system is described 
by a potential that is not yet explicitly known, then one may try, for example, 
to determine F(x, uw) experimentally. In this case measuring errors will occur. 
Thus it is clear that very. small changes of F(z, u) must not have any substantial 
ene ct: 


Local consideration. The above considerations are of local nature; F(x, wu) 18 
investigated in the neighbourhood of a given point (x, uo) € Rn, Of course we 
can set ry=O0€ R, and u,=O0€ fi, as well as F(0, 0) =0 without loss of generality. 
The structural stability implies that #¢C™(t,,,) 18 @ reasonable requirement. 
Hence F€m(n +r) is a germ in the sense of Def. 34.1.1/2. The control parameters 
u¢ R, vary continuously (which is also a criterion of choice for these parameters), 
whereas the state parameters x€ #,, may also vary discontinuously (jumping 
from one relative minimum to another). In the next subsection we shall describe 
the rules of this jumping in greater detail. We assume that <=0¢€ #, and w= 
=0€R, is an isolated point at which the system exhibits a degenerate beha- 
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viour, while there are relative minima of F(x, w) in the above sense in every 
neighbourhood W of O€ R,+,;. This suggests the following interpretation: 
F(x,u)=(F,r) is that stable unfolding (of codimension r) of a germ f€m(n) 
which has a local minimum at O€ 2,4, in the sense of Def. 34.5.6. Thus the 
term “stable” has a precise mathematical meaning, meeting the sense of the 
above heuristic concept of ‘structural stability”. This conception of stability 
fits the situation very well, because only those deformations are admitted which 
involve separate transformations of the control parameters on the one hand and 
the state parameters on the other. 


Catastrophes: The above system realizes a process: every control parameter uw 
is assigned a state x(w). In the next subsection we shall further comment on 
this assignment. A regular point of the process is a parameter value wp in the 
control space, in the neighbourhood of which x(w) varies continuously as u 
varies continuously. If wv» is not a regular point, it is called a catastrophe point. 
For an unfolding of f€m(n) to have a local minimum at 0, it is necessary that 
f€m2(n) (ef. Def. 34.1.1/2). The case where Ff, as an unfolding of f€m2(n), has a 
simple minimum in the point 0 in the sense of Subsec. 34.5.6. is of no interest 
either. For in this case u=0 is a regular point. 


Elementary catastrophes: If we now assume in addition that f¢m(n) is finitely 
determined and that r=4, then it is possible to apply Theorem 34.5.7 (the 
structure of the relative minima of F(x, x), with uw fixed, is not altered by the 
reduction in the sense of Theorem 34.5.7). In other words, the general catas- 
trophe phenomena can be reduced to 7 basic types, provided that f€m2(2) is 
finitely determined and r=4. What is annoying for the moment is the require- 
ment that f be finitely determined. But we shall see later on in Subsec. 35.2.1. 
that this restriction is of generic character and can always be assumed to be 
fulfilled. The topological situation of catastrophe events is thus completely 
described by Thom’s list, provided that r=4. : 


Global consideration: All the considerations made so far are of local nature, i.e. 
they refer to a neighbourhood of an (x, w) point. A global process has to be 
decomposed into local processes: in the neighbourhood of degenerate points in 
the x,u-space one has to find the associated “catastrophe potentials” (after 
being reduced, they are unfoldings in the sense of Theorem 34.5.7). Subsequently 
one has to combine the local catastrophes smoothly (and in a physically or 
biologically reasonable way). There are no recipes for such procedures. 


Character recognition: Given a process (x(u), uw) of which we believe (or hope) 
that it can be described by a (still unknown) potential F(.r, w) in the sense set 
out above, and that r=4 with respect to the number of control parameters 
u€ h,. Then one may try to determine all catastrophe points vu experimentally, 
that is, all those points uw for which x(w) is possibly discontinuous. In the neigh- 
bourhood of a catastrophe point it can be expected that, in the course of a 
great number of repetitions of suitable experiments, all the stable states possible 
are in fact assumed. Now one can compare the data obtained with the 7 elemen- 
tary catastrophes listed in Theorem 34.5.7 (where reductions, diffeomorphisms 
etc. must be climinated by means of a computer). It can be hoped that in this 
way it is possible to calculate (guess) the catastrophe potential and to determine 
its type. 
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Formalization. If (F, r) = F(x, u) isa stable unfolding of f€ m2(n), then we set 


OF or 
Spa uve IX aye ee, b= eee = ety ») =o}, (1) 
vy OX 
le mer.tac {22.¥" | 
le= w, Ee Sis d st | ——— v = H 
k \(2 ue Xp| de nan, eae u)=0 (2) 
Dp={u€R,| 3r€R, with (2, w)ectp}. 2) 


The stable states of the system with the potential F(x, u) form a subsct of Dp, 
nainely the minima contained in (1). If (2, «) is such a minimum, for which 


o2F \ 
det | ——_ 2 Mea0) , 
ale ( 


then one has a simple minimum as defined in Subsec. 34.5.6. (cf. Remark 
34.3.2/4), Then wu is not a catastrophe point (at least as far as the delay rule to 
be considered in the next subsection is concerned). Here we have to choose our 
words with care, because the Maxwell convention to be stated in Subsec. 35.1.2. 
does not quite fit into this scheme. In any case it is clear, however, that the set 
Dy is of special interest in the investigation of possible catastrophe points. 


Remark. All considerations are local. It is however reasonable to investigate the 7 elemen- 
tary catastrophes listed in Theorem 34.5.7 on a global scale. 


35.1.2. The Local Regime 


As in Subsec. 35.1.1., we consider a (physical, chemical, or- biological) system 
that has a potential F(z, u)€m(n+r) with the control parameters ué R, and 
the state parameters x¢€ R,. As was stated previously, a process realized by this 
system is described by (x(w), u). For the investigation of the possible catastrophe 
phenomena we may confine ourselves to Thom’s list given in Subsec. 34.5.7. 


2, = {juve R, |4x?-2ux+v=0) 
dr ={(x,une 3, 112x?-2u =o) 
O, = (luv) Ry |27v? =8 u}, 


Fig. 35.1 
29 Triebel, Math. Physics 
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Example (the cusp). Let f(v)=2%¢m7(1), F(x, u, v)=2t—uxrtox (Fig. 35.1). 
Then Xp, Jy and Dy have the form indicated in the figure. In the left drawing, 
the u,v-axes are drawn twice, while the x-axis is the saine in both co-ordinate 
systems. The right drawing shows curves of F(z, u) for several parameter values 
of (u,v). (uw, v) are control parameters, x is a phase parameter. The catastrophe 
in question is the second type of the elementary catastrophes of Theorem 34.5.7. 
Here D,y is a semicubical or Neil parabola. In the interior of the Neil parabola 
(shaded area), the function F, with fixed (u, v), has two minima for each para- 
ineter value, one of which disappears on Dy, and in the exterior of the Neil 
parabola there is only one minimum for each curve. The two minima in the 
interior of the Neil parabola are cqual if and only if v=0 (positive u-axis). 


Local regime (Fig. 35.2): Now let F(x, u), where we R, and x¢F,, be an arbi- 
trary potential. The process the system runs through in a neighbourhood of 
x=0, u=0 is described by (x(u), u), where a(u) is a relative minimum of F(z, x) 
for u being fixed (local regime). The question is which minimum is assumed (in 
case there are more than one), and how the system develops in response to a 
variation of w. For this there are two rules: 


Delay rule: For w¢R,, let the system be in the state x(w") (relative minimum). 
If u varies continuously (in a neighbourhood of u), then the system tends to change 
its state continuously, too. It witl remain as long as possible in a state x(u) that 
can be changed continuously into x(w®). Catastrophes (jump-like changes of 
state) will occur if and only if in the variation of « the associated minimum 
a(u) disappears. If, in the above example, (u°, v°)€ Ry is a point of the interior 
shaded area, then catastrophes can occur on Dp (and only there). Whether or 
not a catastrophe will in fact occur when Dy is crossed depends on the particu- 
lar minimum chosen. If D, is crossed from the exterior, then a catastrophe will 
not occur. 


Maxwell convention: For each fixed uc R,, the system always realizes the state 
corresponding to the absolute minimum. Thus catastrophes can occur only at 
those u-values where the absolute minimum is no longer uniquely determined. 
In the above example this is the positive u-axis. 


System development (Fig. 35.3): H the control parameters are varied along 
the path u(r), where t is a curve parameter, then the system runs through the 
states 2=a(u(r)). It is quite useful to fix a path in the u,v-plane of the above 
example and to observe how the system behaves. For the path indicated in the 
figure, catastrophes will occur at 1 and 2, provided that the system obeys the 
delay rule. 


x (u) a Fig. 35.2 
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Remark. At first glance, the Maxwell convention appears somewhat artificial. But it has a 
counterpart in Morse’s theory. For each fixed u¢ Dp, F(x, u) €O>(Ry, f;) is, by Def. 34.3.2, 
a Morse function. But according to Remark 34.3.2/5 the global stability of F(x, wu) is lost if w 
is a point at which the absolute minimum is no longer uniquely determined. Thus the magni- 
tude of the function values also plays a role in singularity theory. 


35.1.3. Examples of Application 


In this subsection we shall describe possible applications, whose detailed 
investigation will, however, be postponed. 


Van der Waals equation: Given a real gas. The 3 characteristic parameters, 
the volume J’, the pressure P, and the teinperature 7, fulfil the van der Waals 
equation of state for 1 mole, 


(P+ ja) bar? 

Here a, b and r are physical constants. One obtains the curve (1) in Fig. 35.4 
for a fixed T below a certain critical temperature. As the pressure is increased 
at constant temperature, the volume decreases, and the gas changes into a 
liquid. The actual process follows curve (2), where the gaseous phase is described 
by section a, the liquid phase by section #. According to Maxewll’s rule the 
straight line in (2) must be so drawn that the two shaded areas are equal. As 
we shall see later on, this corresponds to the Maxwell convention of Subsec. 
39.1.2. The process described in (2) is reversible. When the experimental condi- 
tions are established with extreme care, it is possible to generate an irreversible 
thermodynamic process that corresponds to curve (3). In an analogous way the 
liquid phase can be changed into the gaseous one. By means of catastrophe theory 
it is possible to give a description satisfactory from both qualitative and quanti- 
tative aspects (cf. Subsec. 35.3.1.). The situation suggests regarding P and T 


as control parameters and JV’ as a state parameter. 


Fig. 35.4 


(7) 


Other physical examples. If one investigates a loaded clastic rod, then buckling 
and bulging wilt occur under critical load values. Such as well as similar phenom- 
ena of the theory of elasticity can be treated very well by means of catastrophe 
theory (cf. Subsec. 35.3.2.). The pressure load acting on the rod is the control 
parameter, the deviation from the normal position is the state parameter. In 
geometrical optics, the phenomenon of the caustic (cf. Subsecs. 33.1.3. and 
33.1.4.) can be described by means of the catastrophe theory. We shall not, 
however, discuss this here. In Subsee. 35.3.3. we shall deal with the breaking of 
a water wave when it rolls on the shore. 


aye 
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Biological examples: R. Thom’s book [65] makes propaganda for the application 
of catastrophe theory in biology. In the course of a living organism’s evolution 
the cells specialize. If wé€ Ry is the position of a cell, and if ¢ is the time, then 
(u, t)€ R, can be interpreted as control paramcters. The state of the cell is 
characterized by biochemical parameters. The (wu, é) points at which further 
specializations of the cell are decided are catastrophe points. By analogy with 
the electromagnetic field or the gravitational field, R. Thom says ([65], pp. 
151-152), it is also possible to consider a “life field”. Living organisms are 
then particles, or structurally stable singularities, of this field. All manifestations 
of life then result from the interaction of these particles and from their struggle 
with one another: eating and being eaten. The problem consists in the geomctric 
description of this life field. In Subsec. 35.4.2. we shall discuss two examples: 
the gastrulation of amphibia and the voracity of phagocytes. 

Geometry ef forms: The discussion of the 7 elementary catastrophes leads to 
a certain stock of geometric formations (singularity surfaces etc.). If one consid- 
ers a discontinuous natural process (e.g. geological dislocations, meteorological 
phenomena, medical catastrophes), then one may attempt to reduce the dynamics 
of the process to these basic geometric types. This is closely connected with the 
biological models. We refer to [65]. 

Applications: The catastrophe theory has been applied in medicine, biology, 
thermodynamics, the theory of elasticity, linguistics, meteorology, optics, 
geophysics, sociology and other fields. Apart from the two books mentioned 
above [40, 65] we refer to [50, 73, 77, 87] and [88, Sect. 5.5] in this connection. 
At present, however, it is only the physical applications that have been given 
a mathematically satisfactory treatment. 


35.1.4. The Three Interpretations of Catastrophe Theory 


In reference [21], J. Guckenheimer distinguishes betwcen three interpretations 
of catastrophe theory. 
1. (V.J. Arnold). Catastrophe theory is a singularity theory of surfaces and map- 
pings, i.e., an intramathematical theory whose elements have been presented in 
Chap. 34. This does not exclude selected applications, especially to physical prob- 
lems. 
2. (Rk. Thom). Thom’s point of view is more of philosophical nature. Processes in 
the living and lifeless nature shall be geometrized and represented as a succession 
of (elementary and general) catastrophes. Singularity theory appears as a 
mathematical tool, but is not the heart of the theory. !) 
3. (H.C. Zeeman). The geometric surfaces of singularity theory are used for con- 
crete models in medicine, biology, sociology etc. These models are, on the one 
hand, speculative (as regards the hypotheses used), but on the other hand the 
conclusions drawn from them are conercte enough to allow of criticism (which is 
not always the case in Thom’s approach). A typical example is the model describ- 
ing the aggression of dogs (cf. Subsec. 35.4.3.). It is just these concrete models 
which have made catastrophe theory known beyond the circle of mathematicians 
and physicists. But they have also attracted much criticism; cf. [21]. 


') “Poincaré ... once defined mathematics as the art of giving the same name to different 
things” (E.T. Bell, Men of Mathematics). 
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35.2. Elementary Catastrophes 
35.2.1. The Generic Aspeet 


Let F(x, u), where re R, and wé #,, be a structurally stable potential in the 
sense of Subsec. 35.1.1. We are interested in catastrophe points, where the rules 
stated in Subsec. 35.1.2. shall be taken into account. As stated previously, this 
leads to the following problem. F(.r, w) €m(n +7) is the stable unfolding of a germ 
f€m*(n), with a local minimum at 0€ R,+,. Hf r=4, then it is possible to apply 
Theorem 34.5.7, provided that f is finitely determined. There remains the prob- 
lem of whether or not this assumption (which is necessary according to Theorem 
34.5.4/1) involves an excessive restriction of the set of admissible potentials. 


Theorem. Let r=1, 2, 3, 4. In O-(R,4,, Ry) there exists an open and dense set J, 
whose elements F(x, u)€O"(Rn+,, Ry), rE Ry, wE R,, have the following properties: 
1. Sp, as defined in Subsec. 35.1.1., 7s an r-dimenstonal mantfold. 

2. If (x°, wu) € Lp, then, tn a neighbourhood of (x°, uw), F(x, u) ts a stable unfolding 
of the germ F(x, u°)€C*( Ry, Ry) (with respect to a neighbourhood of x9 in Ry). 


Remark 1. The point (2°, u®) now takes the role of the point (0, 0) in Sec. 34.5. (cf. Def. 
34.5.1). From Theorem 34.5.4/1 it follows that f(z) =F (# +29, u°) €m2(n) is finitely determin- 
ed for FES, and (2°, u°)€Xp. If there is a local minimum, then Thoin’s classification given 


- = 


in Theorem 34.5.7 can also be applied. 


Remark 2.1f a physical process is described by a potential Fo(a, u)€C°(R,,,, Bi), rS4, 
then there exists a potential  €/, in an arbitrary neighbourhood of #9. Hence, in the sense 
of structural stability it is reasonable to require directly that Fy€/J,. 


Remark 3. Two points shall be emphasized once again: (a) Catastrophe phenomena are local 
in nature (even though the elementary types are investigated globally). (b) Thom’s list is 
obtained only after a reduction. Hence for concrete problems one also has to take into 
account constant unfoldings as well as the addition of positive definite quadratic forms. 
For a detailed description of the corresponding mathematical operations, we refer to Appen- 
dix I in [40]. 


35.2.2. Pietures of Elementary Catastrophes 


We are interested in a detailed geometric discussion of the unfoldings / =G;, 
listed in Theorem 34.5.7 as well as of the corresponding formations 2’, 47, and 
D,y mentioned in Subsec. 35.1.1. Especially for the delay rule of Subsec. 35.1.2., 
the possible catastrophe points are contained in Dy. The “cusp” has already been 
examined in this way in Subsec. 35.1.2. Here we shall confine ourselves to select- 
ed examples; the following pictures have been taken from [7] and [65]. 


The fold (Fig. 35.5): 
f=x2emi), F(x, u)=+ur. 
| ee ee 
lx=u=0}, 


The fold is a catastrophe not very interesting. For u<0, F(x, u) has exactly one 
minimum that disappears as u--0. Jumping from one minimum to another does 


not occur. 
30 Triebel, Math. Physics 
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Pig. 35.5 
The swallow-tail (Figs. 35.6 and 35.7): 


f=x5em(1), F(x, u,v, w) =x? + ux + 0x2 + ux , 
J p= {(z, u, v, w)€ R, | 5244+ 3ux2+ 2vx4+w=0}, 
Ar={(a, u, v, w)€ Xp | 20x34 Gux+ 2v=0}, 


we 
Xs) 


Les) 


Dr={(u, v, w)€ Ry [ Sx, where 524+ 3ux2 + 207 + w = 2073 4 6ur + 20=0}. 


The wave-crest (hyperbolic umbilic): 
f=e+yem(2), Fr, y, u,v, w= 234 y+wxry—uxr—vy, 
Lp={(a, y, u,v, w)€ Rs | 3a2+ wy —u =3y2+wr—v=0}, 


| 
Ap=\(2, Y, U,V, W) EL» | Me by =o} : 


Dp= {(u, v, w)€ Be | S(x,y), where u=3224 wy, v=3y2+ wx, w2=36ry}. 


DOeeeee 35.2. Elementary Catustrophes 427 


For a fixed w, 
(x, y)— (eu, ve) = (322+ wy, 3y2+ wz) 


is a C°(Ry, Ry) mapping which transforms the square drawn in the x,y-plane 
into the corresponding figure in the w,v-plane (Fig. 35.8). This figure must not be 
understood to be three-dimensional, but only indicates the different coverings. 
Dy) {w=const} is the image of the hyperbola 36ry=w?. For w=0 this figure 
degenerates: the x,y-square is folded and mapped onto an 4-fold covered uw,v- 
square: the folded handkerchief. Thus the w,v-figure drawn in the diagram is the 
generic image of the folded handkerchief, where the folding takes place along 
the hyperbola 36.ry=w?. 
: u 


Af 
t SExy =Ww? 


UU SS 
The hair (elliptic umbilic) (Fig. 35.9): 
xr? on) 


ad . . 
jay maxy2em2Q), Poe yay vw) =p ry? + wa? ty) —ur—ey, 


Dr={(u, v, w)¢ Ry | A(x, y), where u=2x?—y?+ 2uz, 


v= —2ryt2uy ,2+y2=w 3}. 


ica aed 


Fig. 35.10 
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Dy isa two-dimensional surface in the u,v,w-space. The intersection of Dp with 
w=const is a hypocycloid which can be constructed as follows. A circle of radius 
lewl 


rolls along the inside of a circle of radius |w| (Fig. 35.10). If one fixes a point 


on the smaller circle, then the trace of this point obtained from the rolling circle is 
a hypocycloid. 
The mushroom (parabolic wmbilic) (Fig. 35.11): 
y' y' : 
f=xeyt i Em?(2), F(x, y, u,v, w, ) =a2y+ a + tx? + wy? — ux—vy , 
Lp={(x, y, u,v, w, HE Rg | 2a(y+t)—uH=x24+ y+ 2wy —v=0}, 
2Ay+t) 2x . o} 

22 By2+2w | 3? 

Dp={(u, v, w, NER, | A(x, y), where u=2r(y+2), v=x2+ y3 + Qwy, 

2x? = (y +1)(3y? + 2w)} . 

The mushroom is the most complicated eleinentary catastrophe. Dy is a three- 
dimensional surface in 2. Insertion of x=a(y, ¢, w) into wand v gives u= u(y, t, w) 
and v=v(y, t, w). Then the intersections DpM {w= const, = const} are curves in 
the u,v-plane, with y as a curve parameter. If wand ¢ are so varied that w2+ f= 
=c? (c>0 small), then according to [65], pp. 86-88, one obtains a sequence of 
curves which correspond to the respective domains or curves in the w,-plane 
(Fig. 35.12): if (2, ¢) belongs to domain 1, then the correspondence is to image 1, 
if (w, ¢) lies on curve 6, then the correspondence is to image 6, etc. The most 


VV EM 
YU 
We 


Ap ={(, Y, U,V, W, L)E Lp | 


Fig. 35.11 
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ie ey 


interesting point is that a number of catastrophes considered previously reappear 
as special cases. 1 is a “cusp” (a diffeomorphically distorted Neil parabola); at 2 
there appears a point which subsequently turns into the lip-type catastrophe and 
assumes a mushroom-like shape at 5 (this also accounts for the name). 6 corre- 
sponds to the wave-crest catastrophe (the generic image of the folded handker- 
chief). The adjacent curvilinear triangle is analogous to the hypocycloids of the 
hair catastrophe; the triangle disappears and immediately reappears thereafter. 
Details and formulas are given in [65], pp. 81-90. 


Remark. The origin of the poetic names of the ‘‘swallow-tail’’, the ‘hair’? and the “mush- 
room” becomes clear from a look at the figures. The figure of the butterfly catastrophe is 
similarly suggestive; cf.[7, 65]. Later on we shall return to the interpretation of “wave- 
Guests 


35.3. Applications in Physies 
35.3.1. The van der Waals Equation 


The physical situation has been described in Subsec. 35.1.3. If, in the notation 
used there, P, 7, and V are the pressure, temperature and volume, respectively, 
of a real gas, then the van der Waals equation of state for 1 mole is 


(P+ta)V-H=rP (1) 


It provides a good description outside the range of transition from the liquid to 
the gaseous phase (or vice versa), that is, for the curve sections a and f in Fig. 
35.4(2). We discuss this equation in two variants. 


Ist variant (quantitative). The critical point of (1) is 


a : ; 8a 
EOS 0 OE 
iP 8 V. : 
Setting Dy 1,t= 7 1, and a — 1, one obtains 
323 + (t+ p)x + (8t—2p)=0. 
2 4 . 
The substitution of w= alee +p), v= 3 (8! — 2p) gives 


Ae a — Oe 4) 


430 35. Catastrophes 39.3.1. 


If F= F(x, u,v) =24—ur2+ vx, then F is the nniversal unfolding of {=2*€m2(1) 
according to Theorem 34.5.7, i.e., the cusp. Then (2) agrees with Vy in Subsec. 
35.1.1. Retransformation of F to P, V and 7 yields a function which provides a 
qualitative description of Gibbs’ thermodynamic potential G(V, P, 7’) (free en- 
thalpy) in a neighbourhood of the critical point. The possible stable states of the 
system are described by (2), whcre the modification due to Maxwell’s rule in the 
sense of Fig. 35.4(2) has to be taken into account. In terms of the Gibbs poten- 
tial, Maxwell’s rule reads as follows: for given control parameters P and 7’, the 
system realizes a state V which is given by an absolute minimum of G(V, P, 7) 
(with fixed P and 7’). In the above transformations, this property of G(V, P, T) 
in a neighbourhood of (P., 7'., V.) is transferred to F(x, u, v) in a neighbonrhood 
of 0€ Ry, with uw, » as control parameters and x as a phase parameter. But this is 
the Maxwell convention for the second elementary catastrophe, the cusp. The 
terms “liquid” and “gas” in Fig. 35.13 indicate the liquid and the gaseous phase, 
respectively. The solid arrow indicates the usual catastrophe path with a sudden 
volume change that occurs when a point with »=0 and u=+0 is reached. But the 
starting point and the end point of this process can also be reached on the non- 
catastrophe path indicated by the broken line. The term ‘“‘gaquid” indicates the 
region mentioned previously in connection with Fig. 35.4(3), Subsec. 35.1.3., 
which can be reached under carefully chosen experimental conditions. 


Fig. 35.13 

2nd variant (qualitative). From the point of view of catastrophe theory, there is a 
very elegant, purely qualitative consideration which, to a certain degree of 
necessity, leads to the above picture without requiring knowledge of the van der 
Waals equation of state. This analysis is based on the following hypotheses. 1. The 
gas-liquid mixture has a critical point P,, V,, J’ (which has been established by 
experiment), and the behaviour of the system in the neighbourhood of this point 
is described by a structurally stable catastrophe potential G(V, P, 7). 2. It is 
expedient to regard the pressure P and the temperature 7’ as continuous control 
parameters, and the volume V as a (possibly discontinuous) phase parameter. 
Now, by looking over Thom’s list of Theorem 34.5.7, one finds that there is only 
one possibility to describe this situation, namely the second elementary catas- 
trophe, the cusp. For the local regime one still has two possibilities, the Maxwell 
convention and the delay rule. The experimental data speak in favour of the 
Maxwell convention. 


Conclusion, Catastrophe theory provides a description in the neighbourhood of 
the critical point (P., V., 7c); the van der Waals equation can be used for points 
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faraway from this critical point. The fact that these two entirely different consid- 
erations harmonize so wonderfully is another illustration of the well-known ob- 
servation: 

“Formulae are indeed wiser than men”. 


35.3.2. Eulerian Deformations 


In this point we follow the description given by E. C. Zecman [76]. 


Euler’s bow: Suppose that two rods of equal length are connected by a joint 
(Fig. 35.14). Let the force # act upon the right- and the left-hand end of the bow 
in opposite directions. A possible buckling is counteracted by a spring attached 
to the joint. This enables the system to prevent buckling, provided that # re- 
mains smaller than a critical value f,. For 6+, the jointed rod buckles up or 
down, not jerkily, but rather continuously as # increases continuously. The ex- 
tent of buckling is described by the angle x. (We only consider stable equilibrium 
states, i.e., we exclude the case of unstable equilibrium, x =0 for 6 > £,.) The joint 
of the buckled rod is acted upon by the force «. If z increases, then the system 
will snap into a lower, stable position when a critical value is reached. 


A Pig. 35.14 


Energy balanee: An analysis shows that the system can be described by the po- 
tential 


V(x, x, B) = - w(2x)2 +a sin x—28(1 — cos 2) 


[Ee 


(total energy). Here «+0 is a fixed spring constant. « and # are variable. For 
given « and # the system realizes a state of minimal energy; in particular one 
then has 


iG 


0=—=4yr+2 cosr—26 sine. 
Ox 


This formula shows that # =2u is a specific value for ¢=0 and small values of x 
(which are considered here exclusively). As we shall see, one has 6,=2y. With 
B=2u+6 one obtains 


a ae ; 
V(x, 2, B)=a0—b2? =3 LOGS) (1) 
Be _ —br? a soi 
= ll a(t * br (1 i) +Oe 


Of interest is the behaviour in a neighbourhood of «=b=0. We recall the cusp 
catastrophe and its universal unfolding, as stated in Theorem 34.5.7 and Subsec. 
30.12 ., 

f(a)=xrtem{1), F(x, u, v)=xi— weet or. (2) 
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By means of diffeomorphic mappings, whose concrete form is of no interest 
here, it is now possible to transform (1) into the normal form (2), which means 
that F(a, u, v)~ V(x, «, b+2n), with e~v and b~ xu (where ““~”’ means “approx- 
imately equal”). Hence (1) is a universal (and especially a structurally stable) 


unfolding of the germ x in the sense of Theorem 34.5.7. Thus, in particular, 


I(x, a, 8) qualitatively exhibits the behaviour shown in Subsec. 35.1.2. The local 
regime is determined by the delay rule. A typical example is the path indicated 
in Fig. 35.15. One has «=0 and # <2 along section 1. Here V has exactly one 
minimum at «=0. Along section 2 one has «=0 and B=2u. V has two minima 
inside the (diffeomorphically distorted) Neil parabola. The corresponding (posi- 
tive or negative) x-values indicate the continuous up- or down-buckling. Along 
section 3 the system is considered for increasing a, fixed 6, and x0. According 
to the delay rule the system remains in a state with e+0. This minimum disap- 
pears as the system approaches the point P (compare with the figures for the 
potential in Subsce. 35.1.2.). A catastrophe occurs, the system jumps into the 
remaining stable state with «<0. Along section 4 the system is in a stable state 
coiling =. 0) 


ie Piewaoals 


Euler’s buckling rod (Fig. 35.16): A homogeneous elastic rod with the modulus 
of elasticity « and the length 4 is acted upon by the forces £ from both the right 
and the left end. According to Euler the rod withstands the load until the criti- 


a \2 
cal value B=y =| is reached. Then the rod bows, assuming the shape f(s) = 


=a Sin - +O(x*). Here x is the deflection amplitude (which is assumed to be 


small), s is the arc length, and f(s) is the deflection from the normal position. If 
the bent rod is asymmetrically loaded by «, then initially one obtains a figure 
which corresponds to the sum of two sine curves with the amplitudes x and y 
(Fig. 35.17). Here y= y(«, ¢). As increases the system will approach a critical 
value at which the rod jumps into a lower, stable position. According to Zee- 
man, y=y(z,«) is described by a dual cusp-type catastrophe (Fig. 35.18). 


p 225) ee Fig. 35.16 
a 4004 


f 
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“Dual” means that the stable position is represented by the y-values on the 
shaded intermediate surface. Hf a increases at constant é, then a critical value is 
reached at Q. Then the rod jumps into a new stable position, which is not indi- 
cated in the figure. 


35.3.3. Breaking of Water Waves 


Here we shall follow the presentations of E. C. Zeeman [74] and Thom [65], 
pp. 78-79. A water wave that approaches the shore will break there. The ge- 
ometry of this process can be described adequately by means of the wave-crest 
catastrophe and the associated unfolding according to Theorem 34.5.7 and Sub- 
SG, BDo Loman 


f=H=e+yem(2), F=O+y+wry—ux—vy. 


If one interprets w as atime parameter and wu, vasspace parameters, then accord- 
ing to Subsec. 35.2.2. one obtains the diagrams shown in Fig. 35.19 for 


: Fig. 35.19 


434 35. Catastrophes Bonet. 


Dp {w=const}. For w=0 the curves coincide, while for w= 0 they change parts. 
The change from a round (stable) shape to a cuspidal (unstable) one corresponds 
to the breaking of waves (Fig. 35.20). A generic distortion of this picture, combin- 
ed with propositions of classical hydrodynamics (and a suitable identification 
of the occurring parameters with physical quantities) yields a qualitatively as 
well as quantitatively satisfactory picture (according to Zeeman): at the moment 
of breaking the wave is locally symmetric, the crest angle being 120°; after 
breaking the line section AB is a parabolic are (Fig. 35.21). 


— aN 
A 
ae B 
Fig. 35.21 


35.3.4. Catastrophe Machines 


In this subsection we shall base our considerations on the work of T. Poston 
[33]. Catastrophe machines are explicitly constructed physical (mostly mechani- 
cal) systems that are described by potentials corresponding to the unfoldings of 
the seven elementary catastrophes listed in Theorem 34.5.7. In particular these 
potentials depend on parameters (order of the catastrophe machine). A catas- 
trophe machine of order zero corresponds to the simple ininimum stated in Sub- 
sec. 34.5.6., a catastrophe machine of order one corresponds to the fold-type 
catastrophe, and so on. One may also generalize this concept by considering 
catastrophe machines which depend on infinitely many parameters (or func- 
tions). Examples of this are soap films in wire loops and their catastrophe behav- 
iour when the wire loops are deformed. Here we shall confine ourselves to two 
sunple cases. 


Catastrophe machines of order zero (Fig. 35.22): Consider a strictly convex body 
B with the C~-surface D=8B. Then D is a C=-manifold. If P¢éD, then let 
{(P)cC~(D, FR) be the potential energy of the body in the position in which P 
touches the ground. The body tends to assume a stable position by realizing a 
relative minimum of {(P). Since according to Theorem 34.3.2 (with D instead of 
R,) the Morse functions are dense in C@(D, R,), from the generic point of view one 
may assume that /(P) is a Morse function. The same Theorem 34.3.2, with D 
instead of f,, also implies that every relative minimum is locally stable. After a 
slight deflection the body returns to its stable initial position in an oscillatory 


D 


Fig. 35.22 
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motion. If a body has a symmetry axis, e.g.an egg, then it has unstable equilib- 
rium positions (Fig. 35.23). Such a figure is not generic. This raises the follow- 
ing problem. If a (physical or other) problem exhibits natural symmetries, 
then, from the generic point of view, it appears expedient not to disturb these 
syinmetries. The aim is a singularity theory in which symmetries are taken into 
account, a new version of Thoin’s lst. 


ae, 
» 
Fig. 35.23 
stable small unstable 
equilibrium perturbation equilibrium 


Catastrophe machines of order one: Let f=2x'¢€m2(1) and F=.2x3+ usr be the fold 
catastrophe and its universal unfolding, respectively. It is our aim to find an 
example in the field of mechanics whose potential is essentially equal to F. 
Suppose that a weightless wheel (e.g. of thin sheet metal) of radius 1 is placed on 
an inclined plane with the slope angle a (Fig. 35.24). A heavy weight (e.g. a mag- 
net) is attached to the wheel at a distance c, 0<c<1, from the centre. If c>sin «, 
then there are two equilibrium states: a stable equilibrium with the weight arrang- 
ed at the point A, and an unstable one, where the weight is at the point Bb. For 
c=sin « the system has a singular point, and we set c= —w+sin «. If H(g, wu) is 
the quantity indicated in Fig. 35.25, then the system is in equilibrium if and only 
if H(gy, uv) =0. Of interest is the behaviour in the neighbourhood of the critical 


Fig. 35.24 Fig. 35.25 


point « =¢=0 (Fig. 35.26). One obtains 


H(g, u)=sin «—c cos ¢ =sin x (1—cos yp) +u Cos @ , 


: : sing , 
F(g, u)=sin x (g—sin g)+% sin ara gi+upt..., 


Fig. 35.26 
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oF ; 
where F is a potential with Bo But this is just the unfolding of the fold 
a 


catastrophe. For u <0 there are two g-values such that — = H(g, u)=0. But a 
er Par 


minimum is only obtained for gp, because aa (Po, W=a (Mp, u) > 0. 
We @ 


35.4. Other Applications 
35.4.1. Taylor Series and Cells 


Thom also regards the catastrophe theory as a theory of analogies. From this 
point of view, any two catastrophe events are analogous if they can be described 
by one and the same elementary catastrophe. But these analogies go much far- 
ther, being in part of a very formal nature, as will be shown by the following 
example (cf. [65], pp. 30/31). Let f(x, y)€C=(Ro, Ry), with /(0, 0)=0, where f 
can be formally expanded in a Taylor series, 


eS we, z Cae » Con— 0. 
5,2=0 


As regards the topological nature of f(x, y) €m(2) (normal forms that can be ob- 
tained by local diffeomorphisms), the following can be said: if |a;,y| + laos|>0, 
then, according to Remark 34.3.2/4, the topological structure of f(x, y) is deter- 
mined by a,x +4o0,1y. The normal form is f(x, y) =x. For a, = a,1 = 0 the topolo- 
gical structure is unknown for the present. If 


i 
42,940,2 —A7,1+0, 


then f(x, y) is (locally) a Morse function in the sense of Def. 34.3.2. The topologi- 
cal structure is then determined by dayx?+ 2a, 47y +aoay2, and according to 
Remark 34.3.2/4 the normal forms are f(x, y)= cee Othe other hand, it 
42,0%0,2 — 47, =0, then the topological structure is unclear for the present, and the 
problem must be further investigated. One obtains the following picture: the 
topological structure of the Taylor series with |a, | + lao,1| +0 is determined by 
the linear terms alone, while higher terms have no influence. On the next level 
of expansion, the topological structure of the Taylor series is determined by 
4),0 =d0,1 =9 and a2 a9,. —a7,,+0, and so on. The analogy to cell evolution is as 
follows. At a first stage of evolution the function of certain cells is fixed, so that 
subsequent biochemical perturbations cannot affect the function of these cells. 
Other cells designed for higher tasks have their functions not yet fixed. At a sec- 
ond stage (which largely corresponds with the Morse functions in the above 
considerations) the functions of other cells get fixed and immune to subsequent 
biochemical perturbations. Other cells designed for even higher tasks have 
their functions not yet fixed, and so on. The analogy is clear, though being of a 
very formal nature. 


35.4.2. Applications in Biology 


It is Thom’s idea to draw conclusions about the dynamics of the process from 
the geometry of the catastrophe event (to guess the topological form of the po- 
tential). In particular he applies this method to biological processes. Isolated 
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elementary catastrophes will not suffice for such a program. Oue lias to consider 
collisions of elementary catastrophes, accumulation points of elementary cata- 
strophes, elementary catastrophes occurring along lines and surfaces, etc. This 
leads to highly speculative investigations in evolution biology: specialization 
of cells, formation of bones ete. Without gomg into detail, we shall give two 
examples, which are discussed in [65], pp. 169-171 and pp. 187-189. 


Gastrulation of amphibia: The cells of the embryo sac of amphibia flow into the 
interior of the embryonic sac at a certain point. Part of the sac is introverted, 
forming the blastopore (Fig. 35.27). According to Thom the geometry of this 
process can be described by a swallow-tail catastrophe. 


blastopore 


Phagoeytes: Phagocytes take up foreign matter most easily. The geometry of 
this process can be described by the mushroom catastrophe. The mushroom 
catastrophe (cf. Subsec. 35.2.2.) allows the structurally stable transition from 
the wave-crest catastrophe (hyperbolic umbilic) to the hair catastrophe (elliptic 
umbilic). Following Thom’s presentation ([65], p. 187) (Fig. 35.28), we consider a 
situation which results from a generic deformation of the hair catastrophe (a). As 
the prey approaches, this elliptic state (under tension) turns into a hyperbolic 
one (b). Regime 1 is replaced by the simpler, more relaxed regime 2. The outer 
lips of 1 move outward and close in order to catch something from outside. The 
prey is entrapped in a cavity, and digested (c). Finally the phagocyte returns to 
the elliptic state (d). 


Je 


d) Fig. 35.28 


a) b) 


Remark. These examples are very sketchy and incomplete. But they illustrate one of the 
most essential intents of this approach: to reduce the topology of catastrophe events in 
biology to a few basic types (say, in the sense of Thom’s list). 


35.4.3. Dogs and Mathematicians 


The preceding section gave an impression (if a very rough, incomplete, and 
sketchy one) of the intent of R. Thom. The applications presented by E. C, Zee- 
man are frequently much more concrete, being designed for the geometrization 
of isolated factual situations that depend on a few parameters (which are some- 
times only qualitatively detectable). We shall give two examples: a (bestially) 
earnest one, and another (of human nature) that is meant uot quite so earnestly. 
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Aggression behaviour of dogs (Fig. 35.29): The cusp-type eatastrophe serves as 
the model surface. Fear and rage are the continuous control parameters. The 
behaviour of the dog is the (possibly discontinuous) phase parameter. The pic- 
ture of Zeeman’s dog model has been taken from [21]. 


menacing 


neutral 
~ ee aggressive 
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Mathematicians (Fig. 35.30): Corresponding to this dog model, there is an analo- 
gue (meant not quite seriously) that concerns the mental powers of a mathema- 
tician; it is likewise due to E. C. Zeeman [75]. The model is also based on the 
cusp-type catastrophe. The control parameters are the mathematical and the 
speculative content (of, say, a publication). The phase parameter (which pos- 
sibly varies discontinuously) will answer the question of whether that author is a 
common-or-garden mathematician, a genius, or a fool. Common-or-garden mathe- 
maticians don’t speculate, but a speculating mathematician may very easily turn 
from a genius to a fool. E. C. Zeeman writes: “Thom does not belong to the 
common-or-garden mathematicians, as is shown by his enterprising speculative 
excursions. However closely he sails to the edge, anyhow he always manages to 
stay on the upper surface.” 
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Appendix: Qn the Relation between Geometry and Reality 
during Time’s Change’) 


ie The Emaneipation of Mathematies 


The history of modern mathematies begins with Pythagoras (about 540 B.C.). 
Up to that time mathematics, but especially geometry, had been something like a 
natural science. There were empirieally found “laws’’, verifiable by experiment 
at any time, e.g., geometric theorems for the right-angled triangle etc. Pythago- 
ras was the first to point out that mathematical theorems are based on assump- 
tions (axioms), from which they have to be derived by purely logical deduction. 
He introduced the abstract proof into mathematics: one of the greatest achieve- 
ments in the history of mathematics, which has, however, become almost a 
matter of course today. But Pythagoras deserves mention for another reason, 
too. He and his disciples (the Pythagoreans) believed that the universe is 
“governed” by the natural numbers 1, 2, 3, ... Using simple proeedures (adimit- 
ted by the Pythagoreans) one can derive the negative integers, and hence the 
rational numbers. The second great discovery at the same time meant the end of 
this dream. In our present language it reads: V2 is irrational. This proposition is 
fundamental for the understanding of what modern mathematics is. For up to 
that time all geometric theorems (no matter whether they had been found empiri- 
cally or logically deduced in the sense of Pythagoras) had been verifiable by 
experiment (like laws of natural seienees). The question whether a given straight 
line segment is the rational or irrational multiple of another given line segment 


cannot be decided by experiment. This means that the above fact (that 2 is 
irrational) is an intramathematical theorem that bears no relation to reality 
(provided that reality is assigned the attribute of experimental verifiability). 
Mathematics had emancipated itself: abstract proofs and the existence of intra- 
mathematical theorems gave that stamp to it which makes it stand out among 
the large family of all sciences in a unique way up to this day. 


De Euelid’s Elements 


Fuclid’s Elements date from about 325 B.C.; they represent an attempt to 
axiomatically formulate the geometry of the plane (as far as it concerns points, 
straight lines, and circles). Following the lines of Pythagoras, some few assump- 
tions (axioms) are plaeed at the top, all other theorems being derived from them 
by purely logical deduction. (The fact that, from our present point of view, this 
attempt exhibits gaps does not make any differenee to its supreme importance.) 
The theorems in question in essence concern straight lines, triangles and circles 
in the plane, and have an immediate intuitively geometrie meaning. The empiri- 
cal origin (at least of the great majority) of the geometric theorems is not denied. 


1!) Paper read within the lecture on “History of Mathematics” on the occasion of the 
celebration of the 100th anniversary of A. Einstein’s birth, on March 14th, 1979. 
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The intent is, rather, to systematically order the arsenal of available theorems of 
the geometry of the plane and to reduce them to a few axioms. Here geometry 
becomes an idealized and mathematized reality. 


3. Analytical Geometry 


On June 8th, 1637, Descartes published the work which laid the foundations 
of analytical geometry. The plane, the three-dimensional Euclidean space (and 
its n-dimensional generalization) were equipped with Cartesian co-ordinates 
(which has likewise become a matter of course meanwhile). The intuitive, syn- 
thetically constructive methods of the Greeks were replaced, or at least decisively 
complemented, by analytical methods. This made it possible to describe geo- 
metric figures analytically and to reduce geometric problems to analytical- 
algebraic ones. 


4. Newtou’s Mechanics 


Together with Archimedes and Gauss, Newton is counted among the greatest 
mathematicians that ever lived. His masterpiece, “Philosophiae Naturalis Prin- 
cipia Mathematica”, was published in 1687, counting among the greatest scientif- 
ic books that have ever been written, [46]. It contains, among other things, the 
fundamentals of classical point mechanics, especially an analytical description 
of planetary motion. The three-dimensional Euclidean space and a “uniformly 
flowing” time are regarded as being given. They are, so to speak, divine institu- 
tions which enable Man to make physical investigations. Equipped with the tools 
of analytical geometry, Newton developed his mechanics from a few fundamen- 
tal principles. The impression upon his contemporaries and immediate successors 
must have been enormous. And even today it may well bea good test for a young 
student of mathematics or physics whether or not he becomes enthusiastic at the 
grandiose, self-contained system of classical point mechanics (for it will not do 
without enthusiasm in mathematics). In 1942, on the occasion of the three- 
hundredth anniversary of Newton’s birth, Einstein devoted the following verses 
to his great predecessor. 


“Look to the Heavens, and learn from them 
How one should really honour the Master 

‘The stars in their courses extol Newton’s laws— 
Tn silence eternal.” 


The reality of the three-dimensional Euclidean space as a frame in which physics 
was going on soon became undisputed. (Objections raised by Leibniz were soon 
forgotten. After all, there were no reasons for doubts, for the latter would have 
had to be based on experimentally verifiable facts, at least from the point of 
view of natural science. And there were no such facts at that time.) Geometry, 
especially analytical geometry, was the set of tools to be used for a mathematical 
description of this reality. 
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5. The Parallel Axiom and its Implications 


As was told in Section 2, Euclid has axiomatized the geometry of the plane. 
His axioms are altogether very simple, with the exception of the parallel axiom. 
It reads as follows. // g is a straight line (of infinite extension to both sides), and if 
P is « point that does not lie on q, then there exists one and only one straight line g’ 
(of infinite extension to both sides) which goes through P and does not intersect g 
(Fig. Al). Such a straight line is called a parallel. Over 2000 years mathemati- 


9 Fig. At 


cians tried hard to deduce this complicated axiom from the other, simpler axioms 
of Euclid’s system, but without any success. The situation about 1800 was as 
follows: Newton had been very cautious in his expressions concerning space and 
time. But his successors dogmatized the three-dimensional Euclidean space. It 
was in this space, and only in it, that natural processes could take place. From 
the philosophical side, Kant declared the three-dimensional Euclidean space logi- 
cally necessary. In this situation, the two alternatives to the above parallel 
axiom, namely 

1. There ts no parallel q’ 
and 

2. There are more than one parallels g’ 
sound like heresy. Gauss worked intensively at the parallel axiom, and about 
1816 he (as we know today) arrived at the conception that geometries other than 
the Euclidean are logically possible, too. To put it more precisely: if one replaces 
the parallel axiom in Euclid’s system by either of the two alternatives indicated 
above, then one obtains a consistent mathematical theory, a non-Euclidean 
geometry. Gauss did not publish anything in order to save himself malicious 
criticism. Thus the possibility of non-Euclidean geometries was discovered in- 
dependently of him (and somewhat later) by J. Bolyat and Lobaéevskij. The first 
paper on this subject was published by Lobaéevskij in the “Kasan Herald” in 
1829/30. After the ban had been broken, numerous geometries organized on 
purely intramathematical lines were developed in the second half of the past 
century. Geometry no longer reflected reality; geometry had emancipated itself 
and separated from physics. It had become an intramathematical subdiscipline 
(which does not exclude physical applications). On the other hand it is most 
remarkable that Gauss identified straight lines with rays of light, and that he 
wondered whether reality is in fact described by the Euclidean geometry. Kin- 
stein was to be the first to resume this point. 


6. Differeutial Geometries 


Differential geometries are of special interest for the further considerations. 
If one considers a two-dimensional surface in the three-dimensional HKuclidean 
space, e.g., the surface of a sphere, then one may ask for the intrinsic geometric 
conditions of this surface. Generally these conditions will differ from the familiar 


31 Triebel, Math. Physics 
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ones in the two-dimensional plane. The world of hypothetical two-dimensional 
beings on a spherical surfaee differs from the Euclidean world of corresponding 
two-dimensional beings in the plane. These two-dimensional beings must build 
their geometry on the basis of their (two-dimensional) experience. We, being 
cifted with a third dimension, look at these two-dimensional worlds “‘from above” 
We are the gods of these beings. Our knowledge that this two-dimensional 
world is a surface in the three-dimensional Euclidean space is of no use for these 
beings, because they have no sense for a third dimension. The idea of “intrinsic 
geometries” goes back to Gauss. He largely discussed the intuitive (or, to put it 
otherwise, the real) case of two-dimensional surfaces in the three-dimensional 
Kuelidean space. Here local differential properties of funetions play a deeisive 
role, which also accounts for the name differential geometry. Decisive general- 
izations are due to Riemann, a disciple of Gauss. Differential geometries were in- 
vestigated in arbitrary n-dimensional spaces. The formulations of the problems 
(and, as it will be expected, the answers too) were purely intramathematical. 
Geometry and reality (in the form of physies) had separated definitely, striking 
out for themselves. At least this was the way things looked like at the end of the 
19th century. 


God is cunning, but He is not malicious. 
(A. Einstein) 


Fhe How the Gods Were Robbed of the Time 


About the turn of the century, leading physicists held the view that their 
discipline was on the verge of completion (as far as the theoretieal foundations 
were concerned). The few problems still open were hoped to be soluble within 
the available theories. The mathematics underlying these theories was relatively 
simple (as measured by the level of mathematics of that time, or compared with 
the present inventory of inathematical tools of theoretical physics) and had little 
concern with the problems the majority of the mathematieians were facing at 
that time. One of the still open problems of physics resulted from the different 
space-time structures of Newton’s mechanies and Maxwell’s electrodynamics. 
Newton’s conception of spaee and time (whieh had proved its worth so brilliantly 
in the course of eenturies) was considered unimpeachable. But now it led to 
difficulties. In 1905, Exnstein replaced Newton’s conception of space and time by 
a new one that led to the speeial theory of relativity. The ban was broken, and a 
new conception of spaee and time gained ground. According to Senstem. time 
was absolute, and there was nothing that. had any influence on it. Einstein 
degraded time. It no longer existed a priori, but rather was determined by a 
periodic natural process, i.e., by a physical proeedure. Two observers merely 
stipulate that their “proper times” shall be be determined by one and the same 
physical procedure. Subsequently they can communicate about their proper 
times, comparing them experimentally. In this case the positions of these two 
observers in space and their speeds relative to one another will then play a deci- 
sive role. If these two observers convert their space and time co-ordinates into 
one another, then space and time become intermixed (Lorentz transformations). 
This gave rise to the following comment of Minkowski in 1909: 
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From this hour on, space as such and time as such shall recede to the 
shadows and only a kind of union of the two retain significance.” !) 


Time was, so to speak, stolen from the gods and surrendered to the physicists. As 
a consolation, the gods were allowed to keep hold of the three-dimensional Kucli- 
dean space till 1915. In other words, in the special theory of relativity physics 
likewise takes place in the three-dimensional Euclidean space, which is nnaffected 
by physical processes. The formulae of the special theory of relativity are rela- 
tively simple. It was the interpretation of these formulae which (together with the 
quantum hypothesis) rang in a new era of physics. 


8. The General Theory of Retativity?) 


In 1915, one of the most glorious and ingenious theories ever devised by human 
imagination was completed: Hinste’n’s general theory of relativity. It is a self- 
contained, complicated, highly elegant mathematical-physical theory. Its lan- 
guage is that of the Riemannian geometry (in a generalized sense), which had 
once been believed to have nothing to do with reality. In this theory, Newton’s 
conception of an eternal, really existing three-dimensional Euclidean space 
finally had to be abandoned, too (so the gods are‘unemployed now). The struc- 
ture of space is influenced by physical processes, e.g., by heavy inasses. If one 
considers, for example, the hypothetical two-dimensional beings of Section 6., 
then one can imagine that there are many two-dimensional worlds of such kind, 
with different geometries. A general theory of relativity for these two-dimen- 
sional beings would make, say, the following statement: the surface in which 
these two-dimensional beings are living is not given for ever and a day, but will 
change in the course of time, being influenced by (two-dimensional) physical pro- 
cesses (heavy masses, electromagnetic waves). Our real world is three-dimension- 
al. Since in General Relativity space and time are coupled, one has to add the 
time as the fourth dimension. Thus physical events take place in a four-dimen- 
sional space-time whose structure is determined by a (generalized) Riemannian 
geometry which, in its turn, is influenced by physical processes. 


oF Outlook 


Geometry as a language of physics has proven extremely fruitful. It is just 
in our days that the geometrization of physical processes approaches anothev 
culmination. As one would expect, this is once again connected with an increase 
in complexity of the inventory of mathematical tools used. One of the latest 
developments is the catastrophe theory of the French mathematician hk. Thom. 
His book “Stabilité structurelle et morphogénése”’ (1972) has created a sensation. 
Again it is geometric theories that play a decisive role. The following aphorisin is 
due to Thom: 


!) Wilhelm Busch, in his “Tobias Knopp”, gave the following (unconscious) comment on 
the subject of proper time: ‘“‘One-two-three, in hasty race, time is running, we keep pace. : 
(Well, I have to apologize to serious readers.) 

2) ,,In Einstein’s conception space is no longer the stage on which the drama of physics 
is performed: it is one of the performers® (Sir E. Whittaker, Space and Spirit). 


3i* 
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“Geometry is magic that works ... [swt all magic geometry in the same 
measure as it 1s successful?” 


Is this the Pythagoreans’ mysticism of numbers in the guise of geometry? 
Couldn’t this phrase have been written by Einstein too? 


10. Epilogue 


Prometheus robbed the gods of fire 
and taught man how to use tt. 
Einstein robbed the gods of space-time 
and taught man how to use tt. 
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